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Preface 


This textbook has grown out of the first-semester honors freshman physics course 
that has been taught at Harvard University during recent years. The book is essen¬ 
tially two books in one. Roughly half of it follows the form of a normal textbook, 
consisting of text, along with exercises suitable for homework assignments. The 
other half takes the form of a problem book, with all sorts of problems (with so¬ 
lutions) of varying degrees of difficulty. If you’ve been searching for a supply of 
practice problems to work on, this should keep you busy for a while. 

A brief outline of the book is as follows. Chapter 1 covers statics. Most of this 
will probably look familiar, but you’ll find some fun problems. In Chapter 2, we 
learn about forces and how to apply F = ma. There’s a bit of math here needed 
for solving some simple differential equations. Chapter 3 deals with oscillations 
and coupled oscillators. Again, there’s a fair amount of math needed for solving 
linear differential equations, but there’s no way to avoid it. Chapter 4 deals with 
conservation of energy and momentum. You’ve probably seen much of this before, 
but again, it has lots of neat problems. 

In Chapter 5, we introduce the Lagrangian method, which will undoubtedly be 
new to you. It looks rather formidable at first, but it’s really not all that rough. 
There are difficult concepts at the heart of the subject, but the nice thing is that 
the technique is easy to apply. The situation here analogous to taking a derivative 
in calculus; there are substantive concepts on which the theory rests, but the act of 
taking a derivative is fairly straightforward. 

Chapter 6 deals with central forces, Kepler’s Laws, and such things. Chapter 7 
covers the easier type of angular momentum situations, ones where the direction of 
the angular momentum is fixed. Chapter 8 covers the more difficult type, ones where 
the direction changes. Gyroscopes, spinning tops, and other fun and perplexing 
objects fall into this category. Chapter 9 deals with accelerated frames of reference 
and fictitious forces. 

Chapters 10 through 13 cover relativity. Chapter 10 deals with relativistic kine¬ 
matics - abstract particles flying through space and time. Chapter 11 covers rel¬ 
ativistic dynamics - energy, momentum, force, etc. Chapter 12 introduces the im¬ 
portant concept of “4-vectors.” The material in this chapter could alternatively 
be put in the previous two, but for various reasons I thought it best to create a 
separate chapter for it. Chapter 13 covers a few topics from general relativity. It’s 
not possible for one chapter to do this subject justice, of course, so we’ll just look 
at some basic (but still very interesting) examples. 

1 




2 


CONTENTS 


The appendices contain various useful things. Indeed, Appendices B and C, 
which cover dimensional analysis and limiting cases, are the first parts of this book 
you should read. 

Throughout the book, I have included many “remarks.” These are written in 
a slightly smaller font than the surrounding text. They begin with a small-capital 
“Remark” and end with a shamrock ($)• The purpose of these remarks is to say 
something that needs to be said, without disrupting the overall flow of the argument. 
In some sense these are “extra” thoughts, although they are invariably useful in 
understanding what is going on. They are usually more informal than the rest of 
the text, and I reserve the right to occasionally use them to babble about things 
I find interesting, but which you may find a bit tangential. For the most part, 
however, the remarks address issues and questions that arise naturally in the course 
of the discussion. 

At the end of the solutions to many problems, the obvious thing to do is to 
check limiting cases. 1 I have written these in a smaller font, but I have not always 
bothered to start them with a “Remark” and end them with a “Jfr”, because they 
are not “extra” thoughts. Checking limiting cases of your answer is something you 
should always do. 

For your reading pleasure (I hope), I have included many limericks scattered 
throughout the text. I suppose that they might be viewed as educational, but they 
certainly don’t represent any deep insight I have on the teaching of physics. I have 
written them solely for the purpose of lightening things up. Some are funny. Some 
are stupid. But at least they’re all physically accurate (give or take). 

A word on the problems. Some are easy, but many are very difficult. I think 
you’ll find them quite interesting, but don’t get discouraged if you have trouble 
solving them. Some are designed to be brooded over for hours. Or days, or weeks, 
or months (as I can attest to). I have chosen to write them up for two reasons: (1) 
Students invariably want extra practice problems, with solutions, to work on, and 
(2) I find them rather fun. 

The problems are marked with a number of asterisks. Harder problems earn 
more asterisks, on a scale from zero to four. You may, of course, disagree with 
my judgment of difficulty, but I think that an arbitrary weighting scheme is better 
than none at all. As a rough idea of what I mean by the number of stars: one-star 
problems are solid problems that require some thought, and four-star problems are 
really really really difficult. Try a few and you’ll see what I mean. 

Just to warn you, even if you understand the material in the text backwards and 
forwards, the four-star (and many of the three-star) problems will still be extremely 
challenging. But that’s how it should be. My goal was to create an unreachable 
upper bound on the number (and difficulty) of problems, because it would be an 
unfortunate circumstance, indeed, if you were left twiddling your thumbs, having 
run out of problems to solve. I hope I have succeeded. 

For the problems you choose to work on, be careful not to look at the solution 
too soon. There is nothing wrong with putting a problem aside for a while and 

1 This topic is discussed in Appendix C. 
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coming back to it later. Indeed, this is probably the best way to approach things. If 
you head to the solution at the first sign of not being able to solve a problem, then 
you have wasted the problem. 

Remark: This gives me an opportunity for my first remark (and first limerick, too). 
One thing many people don’t realize is that you need to know more than the correct way(s) to 
do a problem; you also need to be familiar with many incorrect ways of doing it. Otherwise, 
when you come upon a new problem, there may be a number of decent-looking approaches 
to take, and you won’t be able to immediately weed out the poor ones. Struggling a bit 
with a problem invariably leads you down some wrong paths, and this is an essential part 
of learning. To understand something, you not only have to know what’s right about the 
right things; you also have to know what’s wrong about the wrong things. Learning takes a 
serious amount of effort, many wrong turns, and a lot of sweat. Alas, there are no short-cuts 
to understanding physics. 

The ad said, For one little fee, 

You can skip all that course-work ennui. 

So send your tuition, 

For boundless fruition! 

Get your mail-order physics degree! £ 

One last note: the problems with included solutions are called “Problems.” The 
problems without included solutions are called “Exercises.” There is no fundamental 
difference between the two, except for the existence of written-up solutions. 

I hope you enjoy the book! 


— David Morin 
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Chapter 1 

Statics 

Copyright 2004 by David Morin, morin@physics.harvard.edu 


Before reading any of the text in this book, you should read Appendices B and C. 
The material discussed there (dimensional analysis, checking limiting cases, etc.) is 
extremely important. It’s fairly safe to say that an understanding of these topics is 
absolutely necessary for an understanding of physics. And they make the subject a 
lot more fun, too! 

For many of you, the material in this first chapter will be mainly review. As such, 
the text here will be relatively short. This is an “extra” chapter. Its main purpose 
is that it provides me with an excuse to give you some nice statics problems. Try 
as many as you like, but don’t go overboard; more important and relevant material 
will soon be at hand. 


1.1 Balancing forces 

A “static” situation is one where all the objects are motionless. If an object remains 
motionless, then F = ma tells us that the total force acting on it must be zero. 
(The converse is not true, of course. The total force on an object is also zero if 
it moves with constant nonzero velocity. But we’ll deal only with statics problems 
here). The whole goal in a statics problem is to find out what the various forces have 
to be so that there is zero net force acting on each object (and zero net torque, too, 
but that’s the topic of the next section). Since a force is a vector, this goal involves 
breaking the force up into its components. You can pick cartesian coordinates, polar 
coordinates, or another set. It is usually clear from the problem which system will 
make your calculations easiest. Once you pick a system, you simply have to demand 
that the total force in each direction is zero. 

There are many different types of forces in the world, most of which are large- 
scale effects of complicated things going on at smaller scales. For example, the 
tension in a rope comes from the chemical bonds that hold the molecules in the rope 
together (and these chemical forces are just electrical forces). In doing a mechanics 
problem involving a rope, there is certainly no need to analyze all the details of the 
forces taking place at the molecular scale. You simply call the force in the rope a 
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“tension” and get on with the problem. Four types of forces come up repeatedly: 

Tension 

Tension is the general name for a force that a rope, stick, etc., exerts when it is 
pulled on. Every piece of the rope feels a tension force in both directions, except 
the end point, which feels a tension on one side and a force on the other side from 
whatever object is attached to the end. 

In some cases, the tension may vary along the rope. The “Rope wrapped around 
a pole” example at the end of this section is a good illustration of this. In other 
cases, the tension must be the same everywhere. For example, in a hanging massless 
rope, or in a massless rope hanging over a frictionless pulley, the tension must be 
the same at all points, because otherwise there would be a net force on at least one 
tiny piece, and then F = ma would yield an infinite acceleration for this tiny piece. 

Normal force 

This is the force perpendicular to a surface that the surface applies to an object. 
The total force applied by a surface is usually a combination of the normal force and 
the friction force (see below). But for frictionless surfaces such as greasy ones or 
ice, only the normal force exists. The normal force comes about because the surface 
actually compresses a tiny bit and acts like a very rigid spring. The surface gets 
squashed until the restoring force equals the force the object applies. 

Remark: For the most part, the only difference between a “tension” and a “normal 
force” is the direction of the force. Both situations can be modeled by a spring. In the 
case of a tension, the spring (a rope, a stick, or whatever) is stretched, and the force on 
the given object is directed toward the spring. In the case of a normal force, the spring is 
compressed, and the force on the given object is directed away from the spring. Things like 
sticks can provide both normal forces and tensions. But a rope, for example, has a hard 
time providing a normal force. 

In practice, in the case of elongated objects such as sticks, a compressive force is usually 
called a “compressive tension,” or a “negative tension,” instead of a normal force. So by 
these definitions, a tension can point either way. At any rate, it’s just semantics. If you use 
any of these descriptions for a compressed stick, people will know what you mean. £ 

Friction 

Friction is the force parallel to a surface that a surface applies to an object. Some 
surfaces, such as sandpaper, have a great deal of friction. Some, such as greasy ones, 
have essentially no friction. There are two types of friction, called “kinetic” friction 
and “static” friction. 

Kinetic friction (which we won’t cover in this chapter) deals with two objects 
moving relative to each other. It is usually a good approximation to say that the 
kinetic friction between two objects is proportional to the normal force between 
them. The constant of proportionality is called p*, (the “coefficient of kinetic fric¬ 
tion”), where p*, depends on the two surfaces involved. Thus, F = m-N■ where N 
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is the normal force. The direction of the force is opposite to the motion. 

Static friction deals with two objects at rest relative to each other. In the static 
case, we have F < g s N (where g s is the “coefficient of static friction”). Note the 
inequality sign. All we can say prior to solving a problem is that the static friction 
force has a maximum value equal to F max = g s N. In a given problem, it is most 
likely less than this. For example, if a block of large mass M sits on a surface 
with coefficient of friction g s , and you give the block a tiny push to the right (tiny 
enough so that it doesn’t move), then the friction force is of course not equal to 
/ 'j, s N = n.sMg to the left. Such a force would send the block sailing off to the left. 
The true friction force is simply equal and opposite to the tiny force you apply. 
What the coefficient g s tells us is that if you apply a force larger than g s Mg (the 
maximum friction force on a horizontal table), then the block will end up moving 
to the right. 


Gravity 

Consider two point objects, with masses M and m, separated by a distance R. New¬ 
ton’s gravitational force law says that the force between these objects is attractive 
and has magnitude F = GMrn/R 2 , where G = 6.67 • 10 -11 m 3 /(kg • s 2 ). As we 
will show in Chapter 4, the same law applies to spheres. That is, a sphere may be 
treated like a point mass located at its center. Therefore, an object on the surface 
of the earth feels a gravitational force equal to 

F = m ( y G ^)= m 9^ ( L1 ) 


where M is the mass of the earth, and R is its radius. This equation defines g. 
Plugging in the numerical values, we obtain (as you can check) g « 9.8 m/s 2 . Every 
object on the surface of the earth feels a force of mg downward. If the object is not 
accelerating, then there must also be other forces present (normal forces, etc.) to 
make the total force equal to zero. 


Example (Block on a plane): A block of mass M rests on a fixed plane inclined 
at angle 0. You apply a horizontal force of Mg on the block, as shown in Fig. 1.1. 

(a) Assume that the friction force between the block and the plane is large enough 
to keep the block at rest. What are the normal and friction forces (call them N 
and Ff ) that the plane exerts on the block? 

(b) Let the coefficient of static friction be g. For what range of angles 0 will the 
block remain still? 



Figure 1.1 


Solution: 

(a) We will break the forces up into components parallel and perpendicular to the 
plane. (The horizontal and vertical components would also work, but the calcu¬ 
lation would be a little longer.) The forces are N, Ff, the applied Mg, and the 
weight Mg, as shown in Fig. 1.2. Balancing the forces parallel and perpendic- 



Figure 1.2 
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Solution: Consider a small piece of the rope that subtends an angle dO. Let the 
tension in this piece be T (which will vary slightly over the small length). As shown in 
Fig. 1.3, the pole exerts a small outward normal force, N d o, on the piece. This normal 
force exists to balance the inward components of the tensions at the ends. These 
inward components have magnitude T sin(dd/2). Therefore, N d e = 2Tsin(dd/2). 
The small-angle approximation, sin x w x, then allows us to write this as Nao = T dO. 
The friction force on the little piece of rope satisfies F de < /iN rW = pT dO. This 
friction force is what gives rise to the difference in tension between the two ends of 
the piece. In other words, the tension, as a function of 6, satisfies 



Figure 1.3 


T{6 + dff) 
=> dT 



InT 

T 


T{6) 4 - (iT dO 

pTd6 

J pd6 

fiO + C 

r 0 e**, 


(1.7) 


where we have used the fact that T = To when 9 = 0. 

The exponential behavior here is quite strong (as exponential behaviors tend to be). 
If we let n = 1, then just a quarter turn around the pole produces a factor of (AI 2 s» 5. 
One full revolution yields a factor of e 27r ss 530, and two full revolutions yield a factor 
of e 4w » 300,000. Needless to say, the limiting factor in such a case is not your 
strength, but rather the structural integrity of the pole around which the rope winds. 


1.2 Balancing torques 

In addition to balancing forces in a statics problem, we must also balance torques. 
We’ll have much more to say about torque in Chapters 7 and 8, but we’ll need one 
important fact here. 

Consider the situation in Fig. 1.4, where three forces are applied perpendicularly 
to a stick, which is assumed to remain motionless. F\ and F 2 are the forces at the 
ends, and F 3 is the force in the interior. We have, of course, F 3 = F\ + F 2 , because 
the stick is at rest. 

Claim 1.1 If the system is motionless, then F 3 a = F 2 (a + 6). In other words, the 
torques (force times distance) around the left end cancel. And you can show that 
they cancel around any other point, too. 



F 3 

Figure 1.4 


We’ll prove this claim in Chapter 7 by using angular momentum, but let’s give a 
short proof here. 

Proof: We’ll make one reasonable assumption, namely, that the correct relationship 
between the forces and distances is of the form, 


F 3 f(a) = F 2 f(a + b), 


( 1 . 8 ) 
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a b 

Figure 1.5 



asin0 a &sin06 

Figure 1.6 


where f(x ) is a function to be determined . 1 Applying this assumption with the roles 
of “left” and “right” reversed in Fig. 1.4, we have 

F 3 f(b) = F 1 f(a + b ) (1.9) 

Adding the two preceding equations, and using F 3 = F\ + F 2 , gives 

f(a) + f(b) = f(a + b). (1.10) 

This equation implies that f(nx) = nf(x ) for any x and for any rational number 
n, as you can show. Therefore, assuming f(x) is continuous, it must be the linear 
function, f(x) = Ax, as we wanted to show. The constant A is irrelevant, because 
it cancels in eq. ( 1 . 8). 2 ■ 

Note that dividing eq. (1.8) by eq. (1.9) gives F\f(a) = F 2 /( 6 ). and hence 
F\a = F 2 6 , which says that the torques cancel around the point where F 3 is applied. 
You can show that the torques cancel around any arbitrary pivot point. 

When adding up all the torques in a given physical setup, it is of course required 
that you use the same pivot point when calculating each torque. 

In the case where the forces aren’t perpendicular to the stick, the claim applies to 
the components of the forces perpendicular to the stick. This makes sense, because 
the components parallel to the stick have no effect on the rotation of the stick around 
the pivot point. Therefore, referring to the figures shown below, the equality of the 
torques can be written as 

F a asm6 a = F^bsmOb. (1-11) 

This equation can be viewed in two ways: 

• (F a sin 6 a )a = (T), sin 65 ) 6 . In other words, we effectively have smaller forces 
acting on the given “lever-arms” (see Fig. 1.5). 

• F 0 (asin 0 O ) = 1 * 5(6 sin 0 h ). In other words, we effectively have the given forces 
acting on smaller “lever-arms” (see Fig. 1.6). 

Claim 1.1 shows that even if you apply just a tiny force, you can balance the 
torque due to a very large force, provided that you make your lever-arm sufficiently 
long. This fact led a well-known mathematician of long ago to claim that he could 
move the earth if given a long enough lever-arm. 

One morning while eating my Wheaties, 

I felt the earth move ‘neath my feeties. 

The cause for alarm 
Was a long lever-arm, 

At the end of which grinned Archimedes! 

1 What we’re doing here is simply assuming linearity in F. That is, two forces of F applied at a 
point should be the same as a force of 2 F applied at that point. You can’t really argue with that. 

2 Another proof of this claim is given in Problem 12. 
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One handy fact that conies up often is that the gravitational torque on a stick 
of mass M is the same as the gravitational torque due to a point-mass M located at 
the center of the stick. The truth of this statement relies on the fact that torque is 
a linear function of the distance to the pivot point (see Exercise 7). More generally, 
the gravitational torque on an object of mass M may be treated simply as the 
gravitational torque due to a force Mg located at the center of mass. 

We’ll have much more to say about torque in Chapters 7 and 8, but for now 
we’ll simply use the fact that in a statics problem, the torques around any given 
point must balance. 


Example (Leaning ladder): A ladder leans against a frictionless wall. If the 
coefficient of friction with the ground is g, what is the smallest angle the ladder can 
make with the ground and not slip? 


Solution: Let the ladder have mass m and length l. As shown in Fig. 1.7, we have 
three unknown forces: the friction force, F, and the normal forces, N- t and N 2 . And 
we fortunately have three equations that will allow us to solve for these three forces: 
EF vert = 0, EFhoriz = 0, and Er = 0. 

Looking at the vertical forces, we see that Mi = mg. And then looking at the 
horizontal forces, we see that N 2 = F. So we have quickly reduced the unknowns 
from three to one. 

We will now use Er = 0 to find N 2 (or F). But first we must pick the “pivot” point 
around which we will calculate the torques. Any stationary point will work fine, 
but certain choices make the calculations easier than others. The best choice for the 
pivot is generally the point at which the most forces act, because then the Er = 0 
equation will have the smallest number of terms in it (because a force provides no 
torque around the point where it acts, since the lever-arm is zero). 

In this problem, there are two forces acting at the bottom end of the ladder, so this is 
the best choice for the pivot. 3 Balancing the torques due to gravity and N 2 , we have 


N 2 £ sin 6 = mg(£/2) cos 9 @4 N 2 = . (1.12) 


This is also the value of the friction force F. The condition F < gN\ = i-irng therefore 
becomes 


mg 
2 tan 6 


gmg 


(1.13) 


Remarks: The factor of 1/2 in this answer comes from the fact that the ladder behaves 
like a point mass located halfway up. As an exercise, you can show that the answer for the 
analogous problem, but now with a massless ladder and a person standing a fraction / of 
the way up, is tan 0 > f/g. 

Note that the total force exerted on the ladder by the floor points up at an angle given by 
tan/3 = Ni/F = (mg)/(mg/2tan0) = 2tan0. We see that this force does not point along 
the ladder. There is simply no reason why it should. But there is a nice reason why it 
should point upward with twice the slope of the ladder. This is the direction that causes the 
lines of the three forces on the ladder to be concurrent, as shown in Fig. 1.8. 

3 But you should verify that other choices for the pivot, for example, the middle or top of the 
ladder, give the same result. 


N 2 



F 

Figure 1.7 


N 2 



Figure 1.8 



1-8 


CHAPTER 1. STATICS 


This concurrency is a neat little theorem for statics problems involving three forces. The 
proof is simple. If the three lines weren’t concurrent, then one force would produce a nonzero 
torque around the intersection point of the other two lines of force. 4 4i 


Statics problems often involve a number of decisions. If there are various parts 
to the system, then you must decide which subsystems you want to balance the 
forces and torques on. And furthermore, you must decide which point to use as the 
origin for calculating the torques. There are invariably many choices that will give 
you the information you need, but some will make your calculations much cleaner 
than others (Exercise 11 is a good example of this). The only way to know how to 
choose wisely is to start solving problems, so you may as well tackle some... 


4 The one exception to this reasoning is where no two of the lines intersect; that is, where all 
three lines are parallel. Equilibrium is certainly possible in such a scenario, as we saw in Claim 1.1. 
Of course, you can hang onto the concurrency theorem in this case if you consider the parallel lines 
to meet at infinity. 
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1.3 Exercises 

Section 1.1 Balancing forces 

1. Pulling a block * 

A person pulls on a block with a force F, at an angle 9 with respect to the 
horizontal. The coefficient of friction between the block and the ground is /j 
F or what 9 is the F required to make the block slip a minimum? 

2. Bridges ** 

(a) Consider the first bridge in Fig. 1.9, made of three equilateral triangles 
of beams. Assume that the seven beams are massless and that the con¬ 
nection between any two of them is a hinge. If a car of mass m is located 
at the middle of the bridge, find the forces (and specify tension or com¬ 
pression) in the beams. Assume that the supports provide no horizontal 
forces on the bridge. 

(b) Same question, but now with the second bridge in Fig. 1.9, made of seven 
equilateral triangles. 

(c) Same question, but now with the general case of 4n — 1 equilateral trian¬ 
gles. 

3. Keeping the book up * 

The task of Problem 4 is to find the minimum force required to keep a book 
up. What is the maximum allowable force? Is there a special angle that arises? 

Given ft. make a rough plot of the allowed values of F for — ir/2 < 0 < tt/2. 

4. Rope between inclines ** 

A rope rests on two platforms that are both inclined at an angle 9 (which you 
are free to pick), as shown in Fig. 1.10. The rope has uniform mass density, 
and its coefficient of friction with the platforms is 1. The system has left-right 
symmetry. What is the largest possible fraction of the rope that does not 
touch the platforms? What angle 9 allows this maximum value? 

5. Hanging chain ** 

A chain of mass M hangs between two walls, with its ends at the same height. 

The chain makes an angle of 9 with each wall, as shown in Fig. 1.11. Find 
the tension in the chain at the lowest point. Solve this by: 

(a) Considering the forces on half of the chain. (This is the quick way.) Figure 1.11 

(b) Using the fact that the height of a hanging chain is given by y(x) = 

(1/a) cosh(cuc), and considering the vertical forces on an infinitesimal 
piece at the bottom. (This is the long way.) 
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Figure 1.12 



Figure 1.13 



hand 

Figure 1.14 



Section 1.2: Balancing torques 

6. Direction of the force * 

A stick is connected to other parts of a system by hinges at its ends. Show 
that if the stick is massless, then the forces it feels at the hinges are directed 
along the stick; but if the stick has mass, then the forces need not point along 
the stick. 

7. Gravitational torque * 

A horizontal stick of mass M and length L is pivoted at one end. Integrate 
the gravitational torque along the stick (relative to the pivot), and show that 
the result is the same as the torque due to a mass M located at the center of 
the stick. 

8. Tetherball * 

A ball is held up by a string, as shown in Fig. 1.12, with the string tangent 
to the ball. If the angle between the string and the wall is 9, what is the 
minimum coefficient of static friction between the ball and the wall, if the ball 
is not to fall? 

9. Ladder on a corner * 

A ladder of mass M and length L leans against a frictionless wall, with a 
quarter of its length hanging over a corner, as shown in Fig. 1.13. Assuming 
that there is sufficient friction at the corner to keep the ladder at rest, what 
is the total force that the corner exerts on the ladder? 

10. Stick on a corner * 

You hold one end of a stick of mass M and length L. A quarter of the way 
up the stick, it rests on a frictionless corner of a table, as shown in Fig. 1.14. 
The stick makes an angle 9 with the horizontal. What is the magnitude of the 
force your hand must apply, to keep the stick in this position? For what angle 
is the vertical component of your force equal to zero? 

11. Two sticks ** 

Two sticks, each of mass m and length I, are connected by a hinge at their 
top ends. They each make an angle 9 with the vertical. A massless string 
connects the bottom of the left stick to the right stick, perpendicularly, as 
shown in Fig. 1.15. The whole setup stands on a frictionless table. 

(a) What is the tension in the string? 

(b) What force does the left stick exert on the right stick at the hinge? Hint: 
No messy calculations required! 


Figure 1.15 
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12. Two sticks and a wall ** 

Two sticks are connected, with hinges, to each other and to a wall. The bottom 
stick is horizontal and has length L, and the sticks make an angle of 9 with 
each other, as shown in Fig. 1.16. If both sticks have the same mass per unit 
length, p, find the horizontal and vertical components of the force that the 
wall exerts on the top hinge, and show that the magnitude goes to infinity for 
both # —> 0 and 6 —► 7t/2. 5 

13. Stick on a circle ** 

Using the result from Problem 16 for the setup shown in Fig. 1.17, show that 
if the system is to remain at rest, then the coefficient of friction: 


(a) between the stick and the circle must satisfy 

sin# 

^ ~ (1 + cos#) ' 

(b) between the stick and the ground must satisfy 6 


(1.14) 


> sin 9 cos 9 
^ ~ (1 + cos#)(2 — cos 9) ' 


(1.15) 




Figure 1.17 


5 The force must therefore achieve a minimum at some intermediate angle. If you want to go 
through the algebra, you can show that this minimum occurs when cos# = \/3 — 1, which gives 

6 w 43°. 

6 If you want to go through the algebra, you can show that the maximum of the right-hand side 
occurs when cos 9 = y/3— I. which gives 0 « 43°. (Yes, I did just cut and paste this from the 
previous footnote. But it’s still correct!) This is the angle for which the stick is most likely to slip 
on the ground. 
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Figure 1.19 



Figure 1.20 




1.4 Problems 

Section 1.1: Balancing forces 

1. Hanging mass 

A mass m, held up by two strings, hangs from a ceiling, as shown in Fig. 1.18. 
The strings form a right angle. In terms of the angle 9 shown, what is the 
tension in each string? 

2. Block on a plane 

A block sits on a plane that is inclined at an angle 9. Assume that the friction 
force is large enough to keep the block at rest. What are the horizontal 
components of the friction and normal forces acting on the block? For what 9 
are these horizontal components maximum? 

3. Motionless chain * 

A frictionless planar curve is in the shape of a function which has its endpoints 
at the same height but is otherwise arbitrary. A chain of uniform mass per 
unit length rests on the curve from end to end, as shown in Fig. 1.19. Show, 
by considering the net force of gravity along the curve, that the chain will not 
move. 

4. Keeping the book up * 

A book of mass M is positioned against a vertical wall. The coefficient of 
friction between the book and the wall is fi. You wish to keep the book from 
falling by pushing on it with a force F applied at an angle 9 with respect to 
the horizontal (— n/2 < 9 < 7t/ 2), as shown in Fig. 1.20. For a given 9, what 
is the minimum F required? What is the limiting value of 9, below which 
there does not exist an F that will keep the book up? 

5. Objects between circles ** 

Each of the following planar objects is placed, as shown in Fig. 1.21, between 
two frictionless circles of radius R. The mass density of each object is a, and 
the radii to the points of contact make an angle 9 with the horizontal. For 
each case, find the horizontal force that must be applied to the circles to keep 
them together. For what 9 is this force maximum or minimum? 

(a) An isosceles triangle with common side length L. 

(b) A rectangle with height L. 

(c) A circle. 


Figure 1.21 
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6. Hanging rope 

A rope with length L and mass density p per unit length is suspended vertically 
from one end. Find the tension as a function of height along the rope. 

7. Rope on a plane * 

A rope with length L and mass density p per unit length lies on a plane 
inclined at angle 6 (see Fig. 1.22). The top end is nailed to the plane, and the 
coefficient of friction between the rope and plane is p. What are the possible 
values for the tension at the top of the rope? 



Figure 1.22 


8. Supporting a disk ** 

(a) A disk of mass M and radius R is held up by a massless string, as shown 
in Fig. 1.23. The surface of the disk is frictionless. What is the tension 
in the string? What is the normal force per unit length the string applies 
to the disk? 

(b) Let there now be friction between the disk and the string, with coefficient 
p. What is the smallest possible tension in the string at its lowest point? 

9. Hanging chain **** 

(a) A chain with uniform mass density per unit length hangs between two 
given points on two walls. Find the shape of the chain. Aside from 
an arbitrary additive constant, the function describing the shape should 
contain one unknown constant. 

(b) The unknown constant in your answer depends on the horizontal distance 
d between the walls, the vertical distance A between the support points, 
and the length i of the chain (see Fig. 1.24). Find an equation involving 
these given quantities that determines the unknown constant. 

10. Hanging gently ** 

A chain with uniform mass density per unit length hangs between two supports 
located at the same height, a distance 2d apart (see Fig. 1.25). What should 
the length of the chain be so that the magnitude of the force at the supports is 
minimized? You may use the fact that a hanging chain takes the form, y(x) = 
(1/a) cosh (ax'). You will eventually need to solve an equation numerically. 

11. Mountain Climber **** 

A mountain climber wishes to climb up a frictionless conical mountain. He 
wants to do this by throwing a lasso (a rope with a loop) over the top and 
climbing up along the rope. Assume that the climber is of negligible height, 
so that the rope lies along the mountain, as shown in Fig. 1.26. 

At the bottom of the mountain are two stores. One sells “cheap” lassos (made 
of a segment of rope tied to a loop of fixed length). The other sells “deluxe” 
lassos (made of one piece of rope with a loop of variable length; the loop’s 



Figure 1.23 





Figure 1.24 



1=2 

Figure 1.25 



Figure 1.26 
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deluxe 


Figure 1.27 
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Figure 1.28 



Figure 1.29 
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Figure 1.30 


length may change without any friction of the rope with itself). See Fig. 1.27. 
When viewed from the side, the conical mountain has an angle a at its peak. 
For what angles a can the climber climb up along the mountain if he uses: 

(a) a “cheap” lasso? 

(b) a “deluxe” lasso? 

Section 1.2: Balancing torques 

12. Equality of torques ** 

This problem gives another way of demonstrating Claim 1.1, using an inductive 
argument. We’ll get you started, and then you can do the general case. 
Consider the situation where forces F are applied upward at the ends of a 
stick of length £, and a force 2 F is applied downward at the midpoint (see 
Fig. 1.28). The stick will not rotate (by symmetry), and it will not translate 
(because the net force is zero). If we wish, we may consider the stick to have 
a pivot at the left end. If we then erase the force F on the right end and 
replace it with a force 2 F at the middle, then the two 2 F forces in the middle 
will cancel, so the stick will remain at rest. 7 Therefore, we see that a force F 
applied at a distance £ from a pivot is equivalent to a force 2 F applied at a 
distance £/2 from the pivot, in the sense that they both have the same effect 
in cancelling out the rotational effect of the downwards 2 F force. 

Now consider the situation where forces F are applied upward at the ends, 
and forces F are applied downward at the £/3 and 2£/3 marks (see Fig. 1.29). 
The stick will not rotate (by symmetry), and it will not translate (because the 
net force is zero). Consider the stick to have a pivot at the left end. From 
the above paragraph, the force F at 2£/S is equivalent to a force 2 F at £/S. 
Making this replacement, we now have a total force of 3 F at the 1/3 mark. 
Therefore, we see that a force F applied at a distance £ is equivalent to a force 
3 F applied at a distance 1/3. 

Your task is to now use induction to show that a force F applied at a distance 
£ is equivalent to a force nF applied at a distance £/n. and to then argue why 
this demonstrates Claim 1.1. 

13. Find the force * 

A stick of mass M is held up by supports at each end, with each support 
providing a force oiMg/2. Now put another support somewhere in the middle, 
say, at a distance a from one support and b from the other; see Fig. 1.30. 
What forces do the three supports now provide? Can you solve this? 


7 There will now be a different force applied at the pivot, namely zero, but the purpose of the 
pivot is to simply apply whatever force is necessary to keep the left end motionless. 
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14. Leaning sticks * 

One stick leans on another as shown in Fig. 1.31. A right angle is formed 
where they meet, and the right stick makes an angle 9 with the horizontal. 
The left stick extends infinitesimally beyond the end of the right stick. The 
coefficient of friction between the two sticks is p,. The sticks have the same 
mass density per unit length and are both hinged at the ground. What is the 
minimum angle 9 for which the sticks do not fall? 


Figure 1.31 


15. Supporting a ladder * 

A ladder of length L and mass M has its bottom end attached to the ground 
by a pivot. It makes an angle 9 with the horizontal, and is held up by a 
massless stick of length i which is also attached to the ground by a pivot (see 
Fig. 1.32). The ladder and the stick are perpendicular to each other. Find the 
force that the stick exerts on the ladder. 

16. Stick on a circle ** 

A stick of mass density p per unit length rests on a circle of radius R (see 
Fig. 1.33). The stick makes an angle 9 with the horizontal and is tangent 
to the circle at its upper end. Friction exists at all points of contact, and 
assume that it is large enough to keep the system at rest. Find the friction 
force between the ground and the circle. 

17. Leaning sticks and circles *** 

A large number of sticks (with mass density p per unit length) and circles 
(with radius R) lean on each other, as shown in Fig. 1.34. Each stick makes 
an angle 9 with the horizontal and is tangent to a circle at its upper end. The 
sticks are hinged to the ground, and every other surface is frictionless (unlike 
in the previous problem). In the limit of a very large number of sticks and 
circles, what is the normal force between a stick and the circle it rests on, very 
far to the right? (Assume that the last circle leans against a wall, to keep it 
from moving.) 

18. Balancing the stick ** 



Figure 1.32 



Figure 1.33 


/QOO 

Figure 1.34 


Given a semi-infinite stick (that is, one that goes off to infinity in one direc¬ 
tion), determine how its density should depend on position so that it has the 
following property: If the stick is cut at an arbitrary location, the remaining 
semi-infinite piece will balance on a support that is located a distance l from 
the end (see Fig. 1.35). 


H-/-H 

■"—s- 

Figure 1.35 


19. The spool ** 

A spool consists of an axle of radius r and an outside circle of radius R which 
rolls on the ground. A thread is wrapped around the axle and is pulled with 
tension T, at an angle 9 with the horizontal (see Fig. 1.36). 



Figure 1.36 
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(a) Given R and r, what should 6 be so that the spool does not move? 
Assume that the friction between the spool and the ground is large enough 
so that the spool doesn’t slip. 

(b) Given R, r, and the coefficient of friction // between the spool and the 
ground, what is the largest value of T for which the spool remains at 
rest? 

(c) Given R and /j, what should r be so that you can make the spool slip 
with as small a T as possible? That is, what should r be so that the 
upper bound on T from part (b) is as small as possible? What is the 
resulting value of T? 
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1.5 Solutions 

1. Hanging mass 

Balancing the horizontal and vertical force components on the mass gives, respectively 
(see Fig. 1.37), 

Ti sin 9 = T 2 cos9, 

7i cos 9 + T 2 sin 9 = mg. (1.16) 



Figure 1.37 


Solving for T\ in the first equation, and substituting into the second equation, gives 


Ti=mgcos6, and T 2 = mgsinO. (1.17) 

As a double-check, these have the correct limits when 9 —> 0 or 9 —> ir/2. 

2. Block on a plane 

Balancing the forces shown in Fig. 1.38, wee see that F = mg sin 9 and N = mg cos 9. 
The horizontal components of these are FcosO = mg sin 9 cos 9 (to the right), and 
N sin 9 = mg cos 9 sin 9 (to the left). These are equal, as they must be, because the 
net horizontal force on the block is zero. To maximize the value of mg sin 9 cos 9, we 
can either take the derivative, or we can write it as (mg/2) sin 29, from which it is 
clear that the maximum occurs at 9 = 7 r/4. The maximum value is mg/2. 

3. Motionless chain 

Let the curve be described by the function f(x), and let it run from x = a to x = b. 
Consider a little piece of the chain between x and x + dx (see Fig. 1.39). The length 
of this piece is \/l + f 2 dx, and so its mass is p\J 1 + f' 2 dx, where p is the mass 
per unit length. T he comp onent of the gravitational acceleration along the curve is 
— <7 sin# = —gf/sj 1 + f' 2 , with positive corresponding to moving along the curve 
from a to b. The total force along the curve is therefore 




4. Keeping the book up 

The normal force from the wall is F cos 9, so the friction force holding the book up 
is at most pF cos 9. The other vertical forces on the book are the gravitational force, 
which is —Mg, and the vertical component of F, which is F sin 0. If the book is to 
stay up, we must have 

pF cos 9 + F sin9 — Mg > 0. (1.19) 

Therefore, F must satisfy 

„ Mg 

F > ---. 

p cos 9 + sin 9 


( 1 . 20 ) 
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Figure 1.40 



Figure 1.41 



There is no possible F that satisfies this condition if the right-hand side is infinite. 
This occurs when 

taxiO = — jjb. (1.21) 

If 9 is more negative than this, then it is impossible to keep the book up, no matter 
how hard you push. 

5. Objects between circles 


(a) Let N be the normal force between the circles and the triangle. The goal in this 
problem is to find the horizontal component of N, that is, N cos 9. 

From Fig. 1.40, we see that the upward force on the triangle from the normal 
forces is 2N sin 9. This must equal the weight of the triangle, which is ga times 
the area. Since the bottom angle of the isosceles triangle is 29, the top side has 
length 2 L sin 9, and the altitude to this side is L cos 9. So the area of the triangle 
is L 2 sin 9 cos 9. The mass is therefore <jL 2 sin 9 cos 9. Equating the weight with 
the upward component of the normal forces gives N = (gaL 2 /2) cos 9. The 
horizontal component of N is therefore 


N cos 9 


gaL 2 cos 2 9 
2 


( 1 . 22 ) 


This equals zero when 9 = n/2, and it increases as 9 decreases, even though the 
triangle is getting smaller. It has the interesting property of approaching the 
finite number gaL 2 / 2, as 9 —> 0. 

(b) In Fig. 1.41, the base of the rectangle has length 2R(1 — cos 9). Its mass is 
therefore a2RL(l - cos 9). Equating the weight with the upward component of 
the normal forces, 21V sin 9, gives N = gaRL( 1 — cos 9)/ sin 0. The horizontal 
component of N is therefore 


Wcos 

sin 9 

This equals zero for both 9 = n/2 and 9 = 0 (because 1 — cos 6 « 9 2 /2 goes to 
zero faster than sin 0 ss 9, for small 9). Taking the derivative to find where it 
reaches a maximum, we obtain (using sin 2 9=1 — cos 2 9), 


cos 3 9 — 2 cos 0 + 1 = 0. 


(1.24) 


Fortunately, there is an easy root of this cubic equation, namely cos 9=1, which 
we know is not the maximum. Dividing through by the factor (cos 9 — 1) gives 


cos 2 9 + cos# - 1 = 0. 
The roots of this quadratic equation are 


-1±VE 

2 ' 


(1.25) 


(1.26) 


We must choose the plus sign, because we need | cos 0 <1. So our answer is 
cos# = 0.618, which interestingly is the golden ratio. The angle 9 is « 51.8°. 

(c) In Fig. 1.42, the length of the hypotenuse shown is Rsec9, so the radius of 
the top circle is R(sec9 — 1). Its mass is therefore anR 2 (sec9 — l) 2 . Equating 


Figure 1.42 







1.5. SOLUTIONS 


1-19 


the weight with the upward component of the normal forces, 27V sin 0, gives 
N = ganB 2 (sec 0 — l) 2 /(2 sin 9). The horizontal component of N is therefore 


N cos 9 


ganR 2 cos 9 / 


(1.27) 


This equals zero when 9 = 0 (using cos 9 ~ 1 — 9 2 / 2 and sin 9 '« 9, for small 
9). For 9 —> 7t/2, it behaves like 1/cos 9. which goes to infinity. In this limit, 
N points almost vertically, but its magnitude is so large that the horizontal 
component still approaches infinity. 

6. Hanging rope 

Let T(y) be the tension as a function of height. Consider a small piece of the rope 
between y and y + dy (0 < y < L). The forces on this piece are T(y + dy) upward, 
T(y) downward, and the weight pgdy downward. Since the rope is at rest, we have 
T(y + dy) = T(y) + pgdy. Expanding this to first order in dy gives T'(y) = pg. The 
tension in the bottom of the rope is zero, so integrating from y = 0 up to a position 
y gives 

T(y) = pgy. (1.28) 

As a double-check, at the top end we have T(L) = pgL, which is the weight of the 
entire rope, as it should be. 

Alternatively, you can simply write down the answer, T(y) = pgy , by noting that the 
tension at a given point in the rope is what supports the weight of all the rope below 
it. 

7. Rope on a plane 

The component of the gravitational force along the plane is (pL)g sin 9, and the max¬ 
imum value of the friction force is pN = p(pL)g cos 9. Therefore, you might think 
that the tension at the top of the rope is pLg sin 9 — ppLg cos 9. However, this is not 
necessarily the value. The tension at the top depends on how the rope is placed on 
the plane. 

If, for example, the rope is placed on the plane without being stretched, the friction 
force will point upwards, and the tension at the top will indeed equal pLg sin 9 — 
ppLg cos 9. Or it will equal zero if ppLg cos 9 > pLg sin 9, in which case the friction 
force need not achieve its maximum value. 

If, on the other hand, the rope is placed on the plane after being stretched (or equiva¬ 
lently, it is dragged up along the plane and then nailed down), then the friction force 
will point downwards, and the tension at the top will equal pLg sin 9 + ppLg cos 0. 
Another special case occurs when the rope is placed on a frictionless plane, and then 
the coefficient of friction is “turned on” to p. The friction force will still be zero. 
Changing the plane from ice to sandpaper (somehow without moving the rope) won’t 
suddenly cause there to be a friction force. Therefore, the tension at the top will 
equal pLgsin9. 

In general, depending on how the rope is placed on the plane, the tension at the top 
can take any value from a maximum of pLg sin 9 + ppLg cos 9, down to a minimum 
of pLg sin 9 — ppLg cos 9 (or zero, whichever is larger). If the rope were replaced by 
a stick (which could support a compressive force), then the tension could achieve 
negative values down to pLg sin 9 — ppLg cos 9, if this happens to be negative. 
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8. Supporting a disk 

(a) The gravitational force downward on the disk is Mg, and the force upward is 
2 T. These forces must balance, so 


T 


Mg 

~Y 


(1.29) 


We can find the normal force per unit length that the string applies to the disk 
in two ways. 

First method: Let N d6 be the normal force on an arc of the disk that subtends 
an angle dO. Such an arc has length RdO, so N/R is the desired normal force 
per unit arclength. The tension in the string is constant because the string 
is massless, so N is constant, independent of 9. The upward component of 
the normal force is N dO cos 6, where 9 is measured from the vertical (that is, 
—7 t/2 < 9 < 7t/ 2 here). Since the total upward force is Mg, we must have 

r /2 

/ N cos 6 d9 = Mg. (1.30) 

J-n/2 

The integral equals 2N, so we find N = Mg/2. The normal force per unit 
length, N/R, is then Mg/2R. 

Second method: Consider the normal force, N d6, on a small arc of the disk 
that subtends and angle dO. The tension forces on each end of the corresponding 
small piece of string almost cancel, but they don’t exactly, because they point 
in slightly different directions. Their non-zero sum is what produces the normal 
force on the disk. From Fig. 1.43, we see that the two forces have a sum 
of 2Tsin(d#/2), directed inward. Since d6 is small, we can use sin a; « x to 
approximate this as T d9. Therefore, N dB = T d9, and so N = T. The normal 
force per unit arclength, N/R, then equals T/R. Using T = Mg/2 from eq. 
(1.29), we arrive at N/R = Mg/2R. 

(b) Let T{0 ) be the tension, as a function of 9, for —n/2 < 0 < n/2. T will depend 
on 9 now, because there is a tangential friction force. Most of the work for this 
problem was already done in the example at the end of Section 1.1. We will 
simply invoke the second line of eq. (1.7), which says that 8 

dT < gT d9. (1.31) 

Separating variables and integrating from the bottom of the rope up to an angle 
9 gives In {[T{9)/T{ 0)) < g.9. Exponentiating this gives 

T{9) < T(0)e^. (1.32) 

Letting 0 = n/2, and using T(n/2) = Mg/2, we have Mg/2 < T( 0)e ,Mr / 2 . We 
therefore see that the tension at the bottom point must satisfy 

T(0) > (1.33) 

8 This holds for 9 > 0. There would be a minus sign on the right-hand side if 9 < 0. But since 
the tension is symmetric around 9 = 0 in the case we’re concerned with, we’ll just deal with 9 > 0. 




1.5. SOLUTIONS 


1-21 


This minimum value of T(0) goes to Mg/2 as p —> 0, as it should. And it goes 
to zero as /i —> oo, as it should (imagine a very sticky surface, so that the friction 
force from the rope near 0 = -k/2 accounts for essentially all the weight). But 
interestingly, it doesn’t exactly equal zero, no matter now large g is. 

9. Hanging chain 

(a) Let the chain be described by the function y(x ), and let the tension be described 
by the function T(x). Consider a small piece of the chain, with endpoints at 
x and x + dx, as shown in Fig. 1.44. Let the tension at x pull downward at 
an angle 6-\ with respect to the horizontal, and let the tension at x + dx pull 
upward at an angle 02 with respect to the horizontal. Balancing the horizontal 
and vertical forces on the small piece of chain gives 

T(x + dx) cos 02 = T(a:)cos0i, 

T(x + dx) sin #2 = T(x) sinfli + 9 Pdx , (1.34) 

COS 0\ 

where p is the mass per unit length. The second term on the right-hand side is 
the weight of the small piece, because dx/ cos 0-\ (or dx/ cos 02 , which is essen¬ 
tially the same) is its length. We must now somehow solve these two differential 
equations for the two unknown functions, y(x) and T(x). There are various 
ways to do this. Here is one method, broken down into three steps. 

First step: Squaring and adding eqs. (1.34) gives 

(T(x + dx)) 2 = (T(x)) 2 + 2T(x)gptaxi0 1 dx + O(dx 2 ). (1.35) 

Writing T(x + dx) w T(x) + T'(x) dx, and using tan$i = dy/dx = y’, we can 
simplify eq. (1.35) to (neglecting second-order terms in dx) 

T' = gpy'. (1.36) 

Therefore, 

T = gpy + c i, (1.37) 

where c\ is a constant of integration. 


Second step: Let’s see what we can extract from the first equation in eqs. 
(1.34). Using 

cosdi = — - - =, and cos0 2 = — . - , (1.38) 

a/i + lu'/x))^ " y^l + dxffi 

and expanding things to first order in dx, the first of eqs. (1.34) becomes 


T + T'dx 

+ y”dx ) 2 


T 



(1.39) 


All of the functions here are evaluated at x, which we won’t bother writing. 
Expanding the first square root gives (to first order in dx) 


T + T'dxf y'y"dx\ 

y/l + W V i +y' 2 ) 



T{x+dx) / 

Zh. 



T(x) x x+dx 

Figure 1.44 


(1.40) 
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h- d 



-x 0 x = 0 d-x 0 

Figure 1.45 


To first order in dx this yields 

t/ = y'y” 

T 1+Y 2 ' 

Integrating both sides gives 

InT + C2 = ^ ln(l + y' 2 ), 

where C2 is a constant of integration. Exponentiating then gives 
clT 2 = l + y ' 2 , 

where C3 = e° 2 . 


(1.41) 


(1.42) 


(1.43) 


Third step: We will now combine eq. (1.43) with eq. (1.37) to solve for y(x). 
Eliminating T gives c 2 (gpy + c\) 2 = 1 +y' 2 . We can rewrite this is the somewhat 
nicer form, 

1 + y' 2 = a 2 (y + h) 2 , (1-44) 

where a = c$gp, and h = c\ /gp. At this point we can cleverly guess (motivated 
by the fact that 1 + sinh 2 2 = cosh 2 z) that the solution for y is given by 

y(x) + h = — cosha(x + a). (1-45) 


Or, we can separate variables to obtain 


dx = 


_ dy 

\f'q 2 iy + hf~- 1 ’ 


(1.46) 


and then use the fact that the integral of 1/ \Jz 2 — 1 is cosh 1 2, to obtain the 
same result. 

The shape of the chain is therefore a hyperbolic cosine function. The constant 
h isn’t too important, because it simply depends on where we pick the y = 0 
height. Furthermore, we can eliminate the need for the constant a if we pick 
x = 0 to be where the lowest point of the chain is (or where it would be, in the 
case where the slope is always nonzero). In this case, using eq. (1.45), we see 
that y’{ 0) = 0 implies a = 0, as desired. We then have (ignoring the constant 
h) the nice simple result, 

y(x) = — cosh(aa:). (1-47) 

a 

(b) The constant a can be determined from the locations of the endpoints and the 
length of the chain. As stated in the problem, the position of the chain may be 
described by giving (1) the horizontal distance d between the two endpoints, (2) 
the vertical distance A between the two endpoints, and (3) the length l of the 
chain, as shown in Fig. 1.45. Note that it is not obvious what the horizontal 
distances between the ends and the minimum point (which we have chosen as the 
x = 0 point) are. If A = 0, then these distances are simply d/2. But otherwise, 
they are not so clear. 

If we let the left endpoint be located at x = —xq, then the right endpoint is 
located at x = d—x 0. We now have two unknowns, xq and a. Our two conditions 
are 9 

y(d - x 0 ) - y(-x 0 ) = A, (1.48) 

9 We will take the right end to be higher than the left end, without loss of generality. 
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along with the condition that the length equals £, which takes the form (using 
eq- (1-47)) 


t = J ° y/l + y' 2 dx 

1 \d-xo 

= — sinh(ax) , (1-49) 

a I —xq 

where we have used ( d/dz ) cosh z = sinhz, and 1 + sinh 2 z = cosh 2 z. Writing 
out eqs. (1.48) and (1.49) explicitly, we have 

cosh [a(d — xq)) — cosh(—axo) = aX, 

sinh (a(d — xq)) — sinh(—axo) = at. (1.50) 


If we take the difference of the squares of these two equations, and use the 
hyperbolic identities cosh 2 x — sinh 2 x = 1 and cosh x cosh y — sinh x sinh y = 
cosh(x — y), we obtain 

2 — 2cosh(ad) = a 2 (A 2 — £ 2 ). (1.51) 


This is the desired equation that determines a. Given d, A, and £, we can 
numerically solve for a. Using a “half-angle” formula, you can show that eq. 
(1.51) may also be written as 

2 sinh(ad/2) = ay/P ~ A 2 . (1.52) 


Remark: Let’s check a couple limits. If A = 0 and l = d (that is, the chain forms 
a horizontal straight line), then eq. (1.52) becomes 2sinh(o:d/2) = ad. The solution 
to this is a = 0, which does indeed correspond to a horizontal straight line, because 
for small a, eq. (1.47) behaves like ax' 2, 1‘l (up to an additive constant), which varies 
slowly with x for small a. Another limit is where £ is much larger than both d and A. 
In this case, eq. (1-52) becomes 2sinh(ad/2) « at. The solution to this is a very large 
a, which corresponds to a “droopy” chain, because eq. (1.47) varies rapidly with x for 
large a. 4 


10. Hanging gently 

We must first find the mass of the chain by calculating its length. Then we must 
determine the slope of the chain at the supports, so we can find the components of 
the force there. 


Using the given 
function of x is 


information, y(x) = (1/a) cosh (ox), the slope of the chain as a 


- cosh(ax) I = sinh(ax). 


(1.53) 


The total length is therefore (using 1 + sinh 2 2 = cosh 2 z) 


£ 


J y/\ + y' 2 dx 
J cosh (ax) 


sinh(ad). 


(1.54) 
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chain 

Figure 1.46 


The weight of the rope is W = pig, where p is the mass per unit length. Each 
support applies a vertical force of W/2. This must equal Fsin #, where F is the 
total force at each support, and # is the angle it makes with the horizontal. Since 
tan# = y'(d) = sinh(ad), we see from Fig. 1.46 that sin# = tanh(ad). Therefore, 


1 / p# sinh(ad) \ 

tanh(ad) \ a ) 

= — cosh(ad). (1.55) 

a 

Taking the derivative of this (as a function of a), and setting the result equal to zero 
to find the minimum, gives 

tanh(ad) = — . (1.56) 

ad 

This must be solved numerically. The result is 


ad 


1.1997 = g. 


(1.57) 


We therefore have a = g/d, and so the shape of the chain that requires the minimum 

y(x) « ^ cosh (^P) . (1.58) 

From eqs. (1.54) and (1.57), the length of the chain is 

I =— sinh(?7) w (2.52)d. (1.59) 

V 

To get an idea of what the chain looks like, we can calculate the ratio of the height, 
h, to the width, 2d. 


= y(d) - 2/(0) 

2d 2d 

cosh(? 7 ) — 1 

2 g 

w 0.338. (1.60) 

We can also calculate the angle of the rope at the supports, using tan# = sinh(ad). 
This gives tan# = sinh g, and so # « 56.5°. 

Remark: We can also ask what shape the chain should take in order to minimize the 
horizontal or vertical component of F. 

The vertical component, F y , is simply half the weight, so we want the shortest possible chain, 
namely a horizontal one (which requires an infinite F.) This corresponds to a = 0. 

The horizontal component, F x , equals F cos #. From Fig. 1.46, we see that cos 9=1/ cosh(ad). 
Therefore, eq. (1.55) gives F x = pg/a. This goes to zero as a —> oo, which corresponds to a 
chain of infinite length, that is, a very “droopy” chain. £ 
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11. Mountain Climber 

(a) We will take advantage of the fact that a cone is “flat”, in the sense that we can 
make one out of a piece of paper, without crumpling the paper. 

Cut the cone along a straight line emanating from the peak and passing through 
the knot of the lasso, and roll the cone flat onto a plane. Call the resulting figure, 
which is a sector of a circle, S (see Fig. 1.47). If the cone is very sharp, then S 
will look like a thin “pie piece”. If the cone is very wide, with a shallow slope, 
then S will look like a pie with a piece taken out of it. 

Points on the straight-line boundaries of the sector S are identified with each 
other. Let P be the location of the lasso’s knot. Then P appears on each 
straight-line boundary, at equal distances from the tip of S. Let (3 be the angle 
of the sector S. 

The key to this problem is to realize that the path of the lasso’s loop must be 
a straight line on S, as shown by the dotted line in Fig. 1.47. (The rope will 
take the shortest distance between two points because there is no friction. And 
rolling the cone onto a plane does not change distances.) A straight line between 
the two identified points P is possible if and only if the sector S is smaller than 
a semicircle. The condition for a climbable mountain is therefore (3 < 180°. 
What is this condition, in terms of the angle of the peak, a? Let C denote a 
cross-sectional circle of the mountain, a distance d (measured along the cone) 
from the top. 10 A semicircular S implies that the circumference of C equals nd. 
This then implies that the radius of C equals d/2. Therefore, 



Figure 1.47 




a <60°. (1.61) 


This is the condition under which the mountain is climbable. In short, having 
a < 60° guarantees that there is a loop around the cone with shorter length 
than the distance straight to the peak and back. 


Remark: When viewed from the side, the rope will appear perpendicular to the side 
of the mountain at the point opposite the lasso’s knot. A common mistake is to assume 
that this implies that the climbable condition is a < 90°. This is not the case, because 
the loop does not lie in a plane. Lying in a plane, after all, would imply an elliptical 
loop. But the loop must certainly have a kink in it where the knot is, because there 
must exist a vertical component to the tension there, to hold the climber up. If we 
had posed the problem with a planar, triangular mountain, then the condition would 
have been a < 90°. 

(b) Use the same strategy as in part (a). Roll the cone onto a plane. If the mountain 
is very steep, then the climber’s position can fall by means of the loop growing 
larger. If the mountain has a shallow slope, the climber’s position can fall by 
means of the loop growing smaller. The only situation in which the climber will 
not fall is the one where the change in position of the knot along the mountain 
is exactly compensated by the change in length of the loop. 

In terms of the sector S' in a plane, this condition requires that if we move P a 
distance l up (or down) along the mountain, the distance between the identified 
points P must decrease (or increase) by l. In Fig. 1.47, we must therefore have 
an equilateral triangle, so j3 = 60°. 

10 We are considering such a circle for geometrical convenience. It is not the path of the lasso; see 
the remark below. 
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Figure 1.48 
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Figure 1.49 


What peak-angle a does this correspond to? As in part (a), let C be a cross- 
sectional circle of the mountain, a distance d (measured along the cone) from 
the top. Then (3 = 60° implies that the circumference of C equals (n/3)d. This 
then implies that the radius of C equals d/6. Therefore, 

sin(a/2)=^ = i =? a w 19°. (1.62) 

This is the condition under which the mountain is climbable. We see that there 
is exactly one angle for which the climber can climb up along the mountain. The 
cheap lasso is therefore much more useful than the fancy deluxe lasso (assuming, 
of course, that you want to use it for climbing mountains, and not, say, for 
rounding up cattle). 

Remark: Another way to see the /3 = 60° result is to note that the three directions 
of rope emanating from the knot must all have the same tension, because the deluxe 
lasso is one continuous piece of rope. They must therefore have 120° angles between 
themselves (to provide zero net force on the massless knot). This implies that /3 = 60° 
in Fig. 1.47. 


Further remarks: For each type of lasso, we can also ask the question: For what 
angles can the mountain be climbed if the lasso is looped N times around the top of 
the mountain? The solution here is similar to that above. 

For the “cheap” lasso of part (a), roll the cone N times onto a plane, as shown in 
Fig. 1.48 for N = 4. : . The resulting figure, Sn, is a sector of a circle divided into N 
equal sectors, each representing a copy of the cone. As above, Sn must be smaller 
than a semicircle. The circumference of the circle C (defined above) must therefore be 
less than nd/N. Hence, the radius of C must be less than d/2N. Thus, 

*4 «< 2 »‘(Zv)- < 1 - 63 > 

For the “deluxe” lasso of part (b), again roll the cone N times onto a plane. From the 
reasoning in part (b), we must have N/3 = 60°. The circumference of C must therefore 
be ird/3N, and so its radius must be d/6N. Therefore, 


sin(a/2) = 


d/m 


6 N 


a = 2 sin -1 ( ). 


(1.64) 


12. Equality of torques 

The proof by induction is as follows. Assume that we have shown that a force F 
applied at a distance d is equivalent to a force kF applied at a distance d/k, for all 
integers k up to n — I. We now want to show that the statement holds for k = n. 
Consider the situation in Fig. 1.49. Forces F are applied at the ends of a stick, and 
forces 2 F/(n — 1) are applied at the jl/n marks (for 1 < j < n — 1). The stick will 
not rotate (by symmetry), and it will not translate (because the net force is zero). 
Consider the stick to have a pivot at the left end. Replacing the interior forces by 
their equivalent ones at the I/n mark (see Fig. 1.49) gives a total force there equal to 

^(l + 2 + 3 + ... + (n-l)) = = nF . (1.65) 

We therefore see that a force F applied at a distance £ is equivalent to a force nF 
applied at a distance £/n, as was to be shown. 

We can now show that Claim 1.1 holds, for arbitrary distances a and b (see Fig. 1.50). 


Figure 1.50 
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Consider the stick to be pivoted at its left end, and let e be a tiny distance (small 
compared to a). Then a force F 3 at a distance a is equivalent to a force F :i (a/e) at a 
distance e. 11 But a force F 3 (a/e) at a distance e is equivalent to a force F 3 (a/e)(e/ (a+- 
b )) = F 3 a/(a + b) at a distance (a + b). This equivalent force at the distance (a + b) 
must cancel the force F 2 there, because the stick is motionless. Therefore, we have 
F 3 a/(a + b) = F 2 , which proves the claim. 

13. Find the force 

In Fig. 1.51, let the supports at the ends exert forces and F 2 , and let the support 
in the interior exert a force F. Then 


n r 

A T~ 


_ b_ 

| Mg 


1 


Figure 1.51 


F\ + F 2 + F = Mg. 


( 1 . 66 ) 


Balancing torques around the left and right ends gives, respectively, 

Fa + F 2 (a + b) = , 

Fb + F^a + b) = Mg 0 ^, (1.67) 

where we have used the fact that the stick can be treated as a point mass at its 
center. Note that the equation for balancing the torques around the center of mass is 
redundant; it is obtained by taking the difference of the two previous equations and 
then dividing by 2. And balancing torques around the middle pivot also takes the 
form of a linear combination of these equations, as you can show. 

It appears as though we have three equations and three unknowns, but we really have 
only two equations, because the sum of eqs. (1.67) gives eq. (1.66). Therefore, since 
we have two equations and three unknowns, the system is underdetermined. Solving 
eqs. (1.67) for F\ and F 2 in terms of F, we see that any forces of the form 


(Fi,F,F 2 ) 


(Mg Fb Mg 

(y i+v y 


( 1 . 68 ) 


are possible. In retrospect, it makes sense that the forces are not determined. By 
changing the height of the new support an infinitesimal distance, we can make F be 
anything from 0 up to Mg(a+b)/2b, which is when the stick comes off the left support 
(assuming b> a). 

14. Leaning sticks 

Let Mi be the mass of the left stick, and let M r be the mass of the right stick. Then 
Mi/M r = tan 0 (see Fig. 1.52). Let N and Ff be the normal and friction forces 
between the sticks. Ff has a maximum value of fxN. Balancing the torques on the 
left stick (around the contact point with the ground) gives 

N=^-s in <9. (1.69) 


Balancing the torques 
gives 


the right stick (around the contact point with the ground) 


Ff 


M r g 


(1.70) 


11 Technically, we can use the reasoning in the previous paragraph to say this only if a/e is an 
integer, but since a/e is very large, we can simply pick the closest integer to it, and there will be 
negligible error. 



Figure 1.52 
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The condition Ff < pN becomes 

M r cos 9 < pMi sin 9. (1-71) 

Using Mi/M r = tan 9, this becomes 

tan 2 9 > - . (1.72) 

This is the condition for the sticks not to fall. This answer checks in the two extremes: 
In the limit p —> 0, we see that 9 must be very close to 7t/2, which makes sense. And 
in the limit p —> oo (that is, very sticky sticks), we see that 9 can be very small, which 
also makes sense. 


15. Supporting a ladder 

Let F be the desired force. Note that F must be directed along the stick, because 
otherwise there would be a net torque on the (massless) stick relative to the pivot at 
its right end. This would contradict the fact that it is at rest. 

Look at torques on the ladder around the pivot at its bottom. The gravitational force 
provides a torque of Mg(L/2) cos 9, tending to turn it clockwise; and the force F 
from the stick provides a torque of F(£/tan9), tending to turn it counterclockwise. 
Equating these two torques gives 


MgL 

2 £ 


sin#. 


(1.73) 



F f 

Figure 1.53 


Remarks: F goes to zero as 9 —> 0, as it should. 12 And F increases to MgL/2£, as 0 —> ir/2, 
which isn’t so obvious (the required torque from the stick is very small, but its lever arm is 
also very small). However, in the special case where the ladder is exactly vertical, no force 
is required. You can see that our calculations above are not valid in this case, because we 
divided by cos#, which is zero when 9 = ir/2. 

The normal force at the pivot of the stick (which equals the vertical component of F, because 
the stick is massless) is equal to MgL sin# cos #/2£. This has a maximum value of MgL/Ai 
at # = 7r/4. & 

16. Stick on a circle 

Let N be the normal force between the stick and the circle, and let Ff be the friction 
force between the ground and the circle (see Fig. 1.53). Then we immediately see 
that the friction force between the stick and the circle is also Ff, because the torques 
from the two friction forces on the circle must cancel. 

Looking at torques on the stick around the point of contact with the ground, we 
have Mgcos9(L/2) = NL, where M is the mass of the stick and L is its length. 
Therefore, N = (Mg/2) cos 9. Balancing the horizontal forces on the circle then gives 
N sin 0 = Ff + Ff cos 9. So we have 


N sin 9 _ Mg sin 9 cos 9 
1 + cos 9 ~ 2(1 +cos#) 


(1.74) 


But M = pL , and from Fig. 1.53 we have L = R,j tan(#/2). Using the identity 
tan(#/2) = sin#/(l + cos#), we finally obtain 


F 


f = 2 P9 R cos#- 


(1.75) 


12 For # —> 0, we would need to lengthen the ladder with a massless extension, because the stick 
would have to be very far to the right to remain perpendicular to the ladder. 
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In the limit 0 —* 7 t/2 , Ff approaches zero, which makes sense. In the limit 0 —> 0 
(which corresponds to a very long stick), the friction force approaches pgR/ 2, which 
isn’t so obvious. 

17. Leaning sticks and circles 

Let Si be the fth stick, and let C t be the ith circle. The normal forces C, feels from Si 
and Si+i are equal in magnitude, because these two forces provide the only horizontal 
forces on the frictionless circle, so they must cancel. Let Ni be this normal force. 
Look at the torques on 5j +1 , relative to the hinge on the ground. The torques come 
from Ni, iVj+i, and the weight of S)+i. From Fig. 1.54, we see that A/ acts at 
a point which is a distance ,Rtan(0/2) away from the hinge. Since the stick has a 
length l?/tan(0/2), this point is a fraction tan 2 (0/2) up along the stick. Therefore, 
balancing the torques on Si +1 gives 

^Mg cos 9+ N tan 2 ^ = N i+1 . (1.76) 

N 0 is by definition 0, so we have A/ = (Mg/2) cos 9 (as in the previous problem). If 
we successively use eq. (1.76), we see that jVjj equals (Mg/2) cos0(l + tan 2 (0/2)), 
and N 3 equals (Mg/2) cos0(l + tan 2 (9/2) + tan 4 (0/2)), and so on. In general, 


• • + tan 2 ^ 1 ) 


(1.77) 


In the limit i —> oo, we may write this infinite geometric sum in closed form as 


lim Ni = TVoo 


Mg cos 9 [ 
2 ' 


1 — tan 2 (0/2) J 


(1.78) 


Note that this is the solution to eq. (1.76), with N, = N t+[ . So if a limit exists, it 
must equal this. Using M = pL = pR/ tan(0/2), we can rewrite N 00 as 


Noo 


pRg cos 0 ( 1 

2tan(0/2) \l-tan 2 (0/2) 


(1.79) 


The identity cos 9 = cos' 2 (9/2) — sin 2 (0/2) may then be used to write this 


Noo 


pRg cos 3 (0/2) 
2sin(0/2) 


(1.80) 


Remarks: N x goes to infinity for 0 —> 0, which makes sense, because the sticks are very 
long. All of the Ni are essentially equal to half the weight of a stick (in order to cancel the 
torque from the weight relative to the pivot). For 0 —> n/2, we see from eq. (1.80) that Noa 
approaches pRg/ 4, which is not at all obvious; the Ni start off at Ni = (Mg/2) cos 0 « 0, 
but gradually increase to pRg/ 4, which is a quarter of the weight of a stick. 

Note that the horizontal force that must be applied to the last circle far to the right is 
Noo sin0 = pRgcos i (9/2). This ranges from pRg for 0 —> 0, to pRg/ 4 for 0 —> n/2. X 
18. Balancing the stick 

Let the stick go off to infinity in the positive x direction, and let it be cut at x = xq. 
Then the pivot point is located at x = xq + £ (see Fig. 1.55). Let the density be p(x). 
The condition that the total gravitational torque relative to x 0 + l equal zero is 

T = y p(x)(x - (x Q +l))gdx = 0. 



R tan 0/2 


Figure 1.54 


A 

x 0 x 0 +l 

Figure 1.55 


(1.81) 



1-30 


CHAPTER 1. STATICS 



F f 

Figure 1.56 



Figure 1.57 


We want this to equal zero for all xo, so the derivative of r with respect to xq must 
be zero, r depends on Xq through both the limits of integration and the integrand. 
In taking the derivative, the former dependence requires finding the value of the 
integrand at the limits, while the latter dependence requires taking the derivative of 
the integrand with respect to xo , and then integrating. We obtain, using the fact that 
p( oo) = 0, 

f°° 

0 = —— = Ip(x o) — / p(x)dx. (1-82) 

dx 0 J XQ 

Taking the derivative of this equation with respect to xo gives 

Ip'{x 0 ) = -p(x 0 ). (1.83) 

The solution to this is (rewriting the arbitrary xo as x) 

p(x) = Ae~ x/e . (1.84) 

We therefore see that the density decreases exponentially with x. The smaller l is, 
the quicker it falls off. Note that the density at the pivot is 1/e times the density at 
the left end. And you can show that 1 — l/e«63%of the mass is contained between 
the left end and the pivot. 

19. The spool 

(a) Let Ff be the friction force the ground provides. Balancing the horizontal forces 
on the spool gives (see Fig. 1.56) 

T cos 6 = Ff. 

Balancing torques around the center of the spool gives 

Tr = F f R. 

These two equations imply 

cos 0=4- (1.87) 

R 

The niceness of this result suggests that there is a quicker way to obtain it. And 

indeed, we see from Fig. 1.57 that cos 9 = r/R is the angle that causes the line 

of the tension to pass through the contact point on the ground. Since gravity 
and friction provide no torque around this point, the total torque around it is 
therefore zero, and the spool remains at rest. 

(b) The normal force from the ground is 

N = Mg- T sin 6 >. (1.88) 


(1.85) 

( 1 . 86 ) 


Using eq. (1.85), the statement Ff < pN becomes T cos 9 < p(Mg 
Hence, 

T < P M 9 
~ cos 9 + p sin 0 ’ 


Tsin0). 

(1.89) 


where 9 is given in eq. (1.87). 

(c) The maximum value of T is given in (1.89). This depends on 9, which in turn 
depends on r. We want to find the r which minimizes this maximum T. 
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Taking the derivative with respect to 9, we find that the 6 that maximizes the 
denominator in eq. (1.89) is given by tan # 0 = A*- You can then show that the 
value of T for this 9q is 


A iMg 


= M g sin # 0 . 


(1.90) 


To find the corresponding r, we can use eq. (1.87) to write tan# = y/R 2 — r 2 /r. 
The relation tan 0q = g then yields 


ro = 



(1.91) 


This is the r that yields the smallest upper bound on T. In the limit /r = 0, we 
have #o = 0, To = 0, and ro = R. And in the limit g = oo, we have #o = 7t/2, 
T 0 = Mg, and ro = 0. 
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Using F = ma 

Copyright 2004 by David Morin, morin@physics.harvard.edu 


The general goal of classical mechanics is to determine what happens to a given set 
of objects in a given physical situation. In order to figure this out, we need to know 
what makes the objects move the way they do. There are two main ways of going 
about this task. The first way, which you are undoubtedly familiar with, involves 
Newton’s laws. This will be the subject of the present chapter. The second way, 
which is the more advanced one, is the Lagrangian method. This will be the subject 
of Chapter 5. 

It should be noted that each of these methods is perfectly sufficient for solving 
any problem. They both produce the same information in the end, but they are 
based on vastly different principles. We’ll talk more about this is Chapter 5. 


2.1 Newton’s Laws 

Newton published his three laws in 1687 in his Principia Mathematica. The laws are 
fairly intuitive, although it seems a bit strange to attach the adjective “intuitive” 
to a set of statements that took millennia for humans to write down. The laws may 
be stated as follows. 

• First Law: A body moves with constant velocity (which may be zero) unless 
acted on by a force. 

• Second Law: The time rate of change of the momentum of a body equals 
the force acting on the body. 

• Third Law: The forces two bodies apply to each other are equal in magnitude 
and opposite in direction. 

We could discuss for days on end the degree to which these statements are 
physical laws, and the degree to which they are definitions. Sir Arthur Eddington 
once made the unflattering comment that the first law essentially says that “every 
particle continues in its state of rest or uniform motion in a straight line except 
insofar that it doesn’t.” Although Newton’s laws might seem somewhat vacuous at 
first glance, there is actually a bit more content to them than Eddington’s comment 
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implies. Let’s look at each in turn. The discussion will be brief, because we have to 
save time for other things in this book that we really do want to discuss for days on 
end. 

First Law 

One thing this law does is give a definition of zero force. 

Another thing it does is give a definition of an inertial frame, which is defined 
simply as a reference frame in which the first law holds. Since the term “velocity” 
is used, we have to state what frame of reference we are measuring the velocity with 
respect to. The first law does not hold in an arbitrary frame. For example, it fails in 
the frame of a spinning turntable. 1 Intuitively, an inertial frame is one that moves 
at constant speed. But this is ambiguous, because we have to say what the frame is 
moving at constant speed with respect to. At any rate, an inertial frame is simply 
defined as the special type of frame where the first law holds. 

So, what we now have are two intertwined definitions of “force” and “inertial 
frame.” Not much physical content here. But, however sparse in content the law is, 
it still holds for all particles. So if we have a frame in which one free particle moves 
with constant velocity, than all free particles move with constant velocity. This is a 
statement with content. 

Second Law 

One thing this law does is give a definition of nonzero force. Momentum is defined 2 
to be mv. If to is constant, 3 then the second law says that 

F = ma, (2.1) 

where a = dv/dt. This law holds only in an inertial frame, which was defined by 
the first law. 

For things moving free or at rest, 

Observe what the first law does best. 

It defines a key frame, 

“Inertial” by name, 

Where the second law then is expressed. 

So far, the second law merely gives a definition of F. But the meaningful state¬ 
ment arises when we invoke the fact that the law holds for all particles. If the same 
force (for example, the same spring stretched by the same amount) acts on two 

1 It is, however, possible to modify things so that Newton’s laws hold in such a frame, but we’ll 
save this discussion for Chapter 9. 

2 We’re doing everything nonrelativistically here, of course. Chapter 11 gives the relativistic 
modification of the mv expression. 

3 We’ll assume in this chapter that m is constant. But don’t worry, we’ll get plenty of practice 
with changing mass (in rockets and such) in Chapter 4. 
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particles, with masses mi and m2, then eq. (2.1) says that their accelerations must 
be related by 


01 _ m 2 
o 2 mi ’ 


( 2 . 2 ) 


This relation holds regardless of what the common force is. Therefore, once you’ve 
used one force to find the relative masses of two objects, then you know what the 
ratio of their a’s will be when they are subjected to any other force. 

Of course, we haven’t really defined mass yet. But eq. (2.2) gives an experimen¬ 
tal method for determining an object’s mass in terms of a standard (say, 1 kg) mass. 
All you have to do is compare its acceleration with that of the standard mass, when 
acted on by the same force. 

There is also another piece of substance in this law, in that it says F = ma, 
instead of, say, F = mv, or F = md 3 x/dt 3 . This issue is related to the first law. 
F = mv is not viable, because the first law says that it is possible to have a velocity 
without a force. And F = md 3 x/dt 3 would make the first law incorrect, because 
it would then be true that a particle moves with constant acceleration (instead of 
constant velocity) unless acted on by a force. 

Note that F = ma is a vector equation, so it is really three equations in one. In 
Cartesian coordinates, it says that F x = ma x , F y = ma y , and F z = ma z . 


Third Law 

This law essentially postulates that momentum is conserved (that is, not dependent 
on time). To see this, note that 

dp _ G?(miVi + m 2 v 2 ) 
dt dt 

= miai + m 2 a 2 

= Fi+F 2 , (2.3) 

where F1 and F 2 are the forces acting on m\ and m 2 , respectively. This demonstrates 
that momentum conservation (that is, dp/dt = 0) is equivalent to Newton’s third 
law (that is, Fi = — F 2 .) 

There isn’t much left to be defined via this law, so the third law is one of pure 
content. It says that if you have two isolated particles interacting through some 
force, then their accelerations are opposite in direction and inversely proportional 
to their masses. 

This third law cannot be a definition, because it’s actually not always valid. It 
only holds for forces of the “pushing” and “pulling” type. It fails for the magnetic 
force, for example. In that case, momentum is carried off in the electromagnetic 
field (so the total momentum of the particles and the field is conserved). But we 
won’t deal with fields here. Just particles. So the third law will always hold in any 
situation we’re concerned with. 
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2.2 Free-body diagrams 

The law that allows us to be quantitative is the second law. Given a force, we 
can apply F = raa to find the acceleration. And knowing the acceleration, we can 
determine the behavior of a given object (that is, where it is and what its velocity is), 
provided that we are given the initial position and velocity. This process sometimes 
takes a bit of work, but there are two basic types of situations that commonly arise. 

• In many problems, all you are given is a physical situation (for example, a 
block resting on a plane, strings connecting masses, etc.), and it is up to you 
to find all the forces acting on all the objects. These forces generally point in 
various directions, so it is easy to lose track of them. It therefore proves useful 
to isolate the objects and draw all the forces acting on each of them. This is 
the subject of the present section. 

• In other problems, you are given the force explicitly as a function of time, 
position, or velocity, and the task immediately becomes the mathematical one 
of solving the F = ma = mx equation (we’ll just deal with one dimension 
here). These differential equations can be difficult (or impossible) to solve 
exactly. They are the subject of Section 2.3. 



Figure 2.1 



Figure 2.2 


Let’s now consider the first of these two types of scenarios, where we are pre¬ 
sented with a physical situation, and where we must determine all the forces in¬ 
volved. The term free-body diagram is used to denote a diagram with all the forces 
drawn on a given object. After drawing such a diagram for each object in the setup, 
we simply write down all the F = ma equations they imply. The result will be a 
system of linear equations in various unknown forces and accelerations, for which 
we must then solve. This procedure is best understood through an example. 


Example (A plane and masses): Mass Mi is held on a plane with inclination 
angle 9, and mass M 2 hangs over the side. The two masses are connected by a 
massless string which runs over a massless pulley (see Fig. 2.1). The coefficient of 
kinetic friction between Mi and the plane is y. Mi is released. Assuming that M 2 is 
sufficiently large so that Mi gets pulled up the plane, what is the acceleration of the 
masses? What is the tension in the string? 

Solution: The first thing to do is draw all the forces on the two masses. These are 
shown in Fig. 2.2. The forces on M 2 are gravity and the tension. The forces on Mi 
are gravity, friction, the tension, and the normal force. Note that the friction force 
points down the plane, because we are assuming that Mi moves up the plane. 
Having drawn all the forces, we now simply have to write down all the F = ma equa¬ 
tions. When dealing with Mi, we could break things up into horizontal and vertical 
components, but it is much cleaner to use the components along and perpendicular 
to the plane. 4 These two components of F = ma, along with the vertical F = ma 


4 When dealing with inclined planes, one of these two coordinate systems will generally work 
much better than the other. Sometimes it’s not clear which one, but if things get messy with one 
system, you can always try the other one. 
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equation for M 2 , give 


T — f — Mi <7 sin d 

= Mia, 


N — Mig cos 9 

= o, 


M 2 g — T 

= M 2 a, 

(2.4) 


where we have used the fact that the two masses accelerate at the same rate (and 
we have defined the positive direction for M 2 to be downward). We have also used 
the fact that tension is the same at both ends of the string, because otherwise there 
would be a net force on some part of the string which would then have to undergo 
infinite acceleration, because it is massless. 

There are four unknowns in eqs. (2.4) (namely T, a, N, and /), but only three 
equations. Fortunately, we have a fourth equation: / = gN. Using this in the 
second equation above gives / = gMig cos 9. The first equation then becomes T — 
/iMigcosO — M-| g sin 0 = M- t a. Adding this to the third equation leave us with only 
a, so we find 

_ g(M 2 — uMi cos9 — Mi sin9) ^ _ MiM 2 g(l + gcos9 + sin#) 

Mi + M 2 ’ Mi + M 2 

(2.5) 

Note that in order for Mi to move upward (that is, a > 0), we must have M 2 > 
Mi (g cos 9 + sin 9) . This is clear from looking at the forces along the plane. 


Remark: If we had instead assumed that Mi was sufficiently large so that it slides down 
the plane, then the friction force would point up the plane, and we would have found, as you 
can check, 


g(M 2 + gMi cos 6 - Mi sin 9) 
Mi + M 2 




T = 


M\M 2 g 
Mi + M 2 


(1 — geos 6 + sin#). 


( 2 . 6 ) 


In order for Mi to move downward (that is, a < 0), we must have M 2 < Mi (sin 9 — geos 9). 
Therefore, the range of M 2 for which the system doesn’t move is Mi (sin 6 — geos 0) < M 2 < 
Mi (sin 0 + g cos 6). & 


In problems like the one above, it is clear what things you should pick as the 
objects on which you’re going to draw forces. But in other problems, where there are 
various different subsystems you can choose, you must be careful to include all the 
relevant forces on a given subsystem. Which subsystems you want to pick depends 
on what quantities you’re trying to find. Consider the following example. 


Example (Platform and pulley): A person stands on a platform-and-pulley 
system, as shown in Fig. 2.3. The masses of the platform, person, and pulley 5 are 
M, to, and g, respectively. 6 The rope is massless. Let the person pull up on the rope 
so that she has acceleration a upwards. 7 

s Assume that the pulley’s mass is concentrated at its center, so that we don’t have to worry 
about any rotational dynamics (the subject of Chapter 7). 

6 My apologies for using g as a mass here, since it usually denotes a coefficient of friction. Alas, 
there are only so many symbols for “to” . 

7 Assume that the platform is somehow constrained to stay level, perhaps by having it run along 
some rails. 



M 

Figure 2.3 
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T T 



Figure 2.4 


(a) What is the tension in the rope? 

(b) What is the normal force between the person and the platform? What is the 
tension in the rod connecting the pulley to the platform? 

Solution: 

(a) To find the tension in the rope, we simply want to let our subsystem be the 
whole system (except the ceiling). If we imagine putting the system in a black 
box (to emphasize the fact that we don’t care about any internal forces within 
the system), then the forces we see “protruding” from the box are the three 
weights ( Mg , mg, and jig) downward, and the tension T upward. Applying 
F = ma to the whole system gives 

T-(M + m+ji)g = (M + m + g)a =► T = (M + m +g)(g + a). (2.7) 

(b) To find the normal force, N, between the person and the platform, and also the 
tension, /, in the rod connecting the pulley to the platform, it is not sufficient 
to consider the system as a whole. We must consider subsystems. 

• Let’s apply F = ma to the person. The forces acting on the person are 
gravity, the normal force from the platform, and the tension from the rope 
(pulling downward on her hand). Therefore, we have 

N — T — mg = ma. (2.8) 

• Now apply F = ma to the platform. The forces acting on the platform are 
gravity, the normal force from the person, and the force upward from the 
rod. Therefore, we have 

f — N — Mg = Ma. (2.9) 

• Now apply F = ma to the pulley. The forces acting on the pulley are 
gravity, the force downward from the rod, and twice the tension in the rope 
(because it pulls up on both sides). Therefore, we have 

2T — / — jig = jia. (2.10) 

Note that if we add up the three previous equations, we obtain the F = ma 
equation in eq. (2.7), as should be the case, because the whole system is the 
sum of the three above subsystems. Eqs. (2.8) - (2.10) are three equations in 
the three unknowns (T, N, and /). Their sum yields the T in (2.7), and then 
eqs. (2.8) and (2.10) give, respectively (as you can show), 

N = (M + 2to + fjb)(g + a), and f = (2M+ 2m +g,)(g + a). (2.11) 

Remark: You can also obtain these results by considering subsystems different from 
the ones we chose above. For example, you can choose the pulley-plus-platform sub¬ 
system, etc. But no matter how you choose to break up the system, you will need 
to produce three independent W «= ma statements in order to solve for the three 
unknowns, T, N, and /. 

In problems like this one, it is easy to forget to include one of the forces, such as the 
second T in eq. (2.10). The safest thing to do is to isolate each subsystem, draw a box 
around it, and then draw all the forces that “protrude” from the box. Fig. 2.4 shows 
the free-body diagram for the subsystem of the pulley. & 
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Another class of problems, similar to the previous example, goes by the name of 
Atwood’s machines. An Atwood’s machine is simply the name for any system that 
consists of a combination of masses, strings, and pulleys. In general, the pulleys and 
strings can have mass, but we’ll just deal with massless ones in this chapter. 

We’ll do one example here, but additional (and stranger) setups are given in the 
exercises and problems for this chapter. As we’ll see below, there are two basic steps 
in solving an Atwood’s problem: (1) Write down all the F = ma equations, and 
(2) Relate the accelerations of the various masses by noting that the length of the 
string doesn’t change (a fact that we’ll call “conservation of string”). 


Example (An Atwood’s machine): Consider the pulley system in Fig. 2.5, with 
masses mi and m 2 . The strings and pulleys are massless. What are the accelerations 
of the masses? What is the tension in the string? 

Solution: The first thing to note is that the tension, T, is the same everywhere 
throughout the massless string, because otherwise there would be infinite acceleration. 
It then follows that the tension in the short string connected to m 2 is 2 T. This is true 
because there must be zero net force on the massless right pulley, because otherwise 
it would have infinite acceleration. The F = ma equations on the two masses are 
therefore 


T — mig = m±ai, 

2 T — m 2 <ji = m 2 a 2 . (2-12) 

We now have two equations in the three unknowns, Oi, a 2 , and T. So we need one 
more equation. This is the “conservation of string” fact, which relates ai and o 2 . If 
we imagine moving m 2 and the right pulley up a distance d, then a length 2d of string 
has disappeared from the two parts of the string touching the right pulley. This string 
has to go somewhere, so it ends up in the part of the string touching m\. Therefore, 
mi goes down by a distance 2d. In other words, y-\ = —2y 2 (where y-\ and y 2 are 
measured relative to the initial locations of the masses). Taking two time derivatives 
of this statement gives our desired relation between ai and o 2 , 


ai = —2a 2 . (2-13) 

Combining this with eqs. (2.12), we can now solve for a\, a 2 , and T. The result is 

2m 2 - 4mi 2mi - m 2 3mim 2 y 

ai = g— -, a 2 = g- -, 7 — , -. (2.14) 

4mi + m 2 4mi + m 2 4mi + m 2 

Remark: There are all sorts of limits and special cases that we can check here. A few 
are: (1) If m 2 = 2mi, then eq. (2.14) gives ai = a 2 = 0, and T = mig. Everything is 
at rest. (2) If m 2 » mi, then eq. (2.14) gives 01 = 2 g, a 2 = — g, and T = 3mi<?. In 
this case, m 2 is essentially in free fall, while mi gets yanked up with acceleration 2 g. The 
value of T is exactly what is needed to make the net force on mi equal to mi(2g), because 
T — mig = 3mig — mig = mi(2g). We’ll let you check the case where mi 3> m 2 . & 



Figure 2.5 
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In the problems for this chapter, you’ll encounter some strange Atwood’s setups. 
But no matter how complicated they get, there are only two things you need to 
do to solve them, as mentioned above: (1) Write down the F = ma equations for 
all the masses (which may involve relating the tensions in various strings), and (2) 
relate the accelerations of the masses, using “conservation of string”. 

It may seem, with the angst it can bring, 

That an Atwood’s machine’s a harsh thing. 

But you just need to say 

That F is ma, 

And use conservation of string! 

2.3 Solving differential equations 

Let’s now consider the type of problem where we are given the force as a function 
of time, position, or velocity, and where our task is to solve the F = ma = mx 
differential equation to find the position, x(t), as a function of time. In what follows, 
we will develop a few techniques for solving differential equations. The ability to 
apply these techniques dramatically increases the number of problems we can solve. 

In general, the force F can also be a function of higher derivatives of x, in 
addition to the quantities t, x, and v = x. But these cases don’t arise much, so 
we won’t worry about them. The F = ma differential equation we want to solve is 
therefore (we’ll just work in one dimension here) 

mx- F(x,v,t). (2.15) 

In general, this equation cannot be solved exactly for x(t). 8 But for most of the 
problems we will deal with, it can be solved. The problems we will encounter will 
often fall into one of three special cases, namely, where F is a function of t only, or 
x only, or v only. In all of these cases, we must invoke the given initial conditions, 
xo = x(to) and no = v(to), to obtain our final solutions. These initial conditions will 
appear in the limits of the integrals in the following discussion. 9 

Note: You may want to just skim the following page and a half, and then refer 
back to it as needed. Don’t try to memorize all the different steps. We present 
them only for completeness. The whole point here can basically be summarized by 
saying that sometimes you want to write x as dv/dt, and sometimes you want to 
write it as vdv/dx (see eq. (2.19)). Then you “simply” have to separate variables 
and integrate. We’ll go through the three special cases, and then we’ll do some 
examples. 

8 It can always be solved for x(t') numerically, to any desired accuracy. This is discussed in 
Appendix D. 

9 It is no coincidence that we need two initial conditions to completely specify the solution to 
our second -order F = mx differential equation. It is a general result (which we’ll just accept here) 
that the solution to an nth-order differential equation has n free parameters, which must then be 
determined from the initial conditions. 
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• F is a function oft only: F = F(t). 

Since a = d 2 x/dt 2 , we just need to integrate F = ma twice to obtain x(t). 
Let’s do this in a very systematic way, to get used to the general procedure. 
First, write F = ma as 

mf t = F(t). (2.16) 

Then separate variables and integrate both sides to obtain 10 

m J dv' — j F(t') dt'. (2-17) 

We have put primes on the integration variables so that we don’t confuse them 
with the limits of integration. Eq. (2.17) yields v as a function of t, v(t). We 
then separate variables in dx/dt = v(t ) and integrate to obtain 


f%m , r l , , 

/ dx = v(t ) dt. 
Jx 0 Jtn 


(2.18) 


This yields i as a function of t, x(t). This procedure might seem like a 
cumbersome way to simply integrate something twice. That’s because it is. 
But the technique proves more useful in the following case. 


• F is a function of x only: F = F(x). 


We will use 


dv _ dx dv _ dv 
dt dt dx dx 


to write F = ma as 


dv rV \ 

mv . = b lx), 
dx 


Now separate variables and integrate both sides to obtain 


(2.19) 

( 2 . 20 ) 


/ V(x) rx 

v dv = / F{x)dx. 


( 2 . 21 ) 


The left side will contain the square of v(x). Taking a square root, this gives 
v as a function of x, v(x). Separate variables in dx/dt = v(x) to obtain 


r x (t) dx 1 _ f* 

L ^) = L 


( 2 . 22 ) 


This gives t as a function of x. and hence (in principle) x as a function of t. 
x(t). The unfortunate thing about this case is that the integral in eq. (2.22) 
might not be doable. And even if it is, it might not be possible to invert t(x) 
to produce x(t). 

10 If you haven’t seen such a thing before, the act of multiplying both sides by the infinitesimal 
quantity dt' might make you feel a bit uneasy. But it is in fact quite legal. If you wish, you 
can imagine working with the small (but not infinitesimal) quantities Av and At, for which it is 
certainly legal to multiply both sides by At. Then you can take a discrete sum over many At 
intervals, and then finally take the limit At —> 0, which results in eq. (2.17) 
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> F is a function of v only: F = F(v). 
Write F = ma as 




(2.23) 


Separate variables and integrate both sides to obtain 

/ v (f) dv' H 

, IS = / dt'. (2.24) 

o FW) Jt 0 K * 

This yields 1 as a function of v. and hence (in principle) v as a function of t, 
v(t). Integrate dx/dt = v(t ) to obtain x(t) from 


/ x(t) ft 

dx = J v(t) d 


(2.25) 


Note: In this F = F(v) case, if you want to find v as a function of x, v(x). 
then you should write a as v(dv/dx) and integrate 


m 


i: 


v' dv' 

W) 



(2.26) 


You can then obtain x(t) from eq. (2.22), if desired. 


When dealing with the initial conditions, we have chosen to put them in the 
limits of integration above. If you wish, you can perform the integrals without any 
limits, and just tack on a constant of integration to your result. The constant is 
then determined from the initial conditions. 

Again, as mentioned above, you do not have to memorize the above three proce¬ 
dures, because there are variations, depending on what you’re given and what you 
want to solve for. All you have to remember is that x can be written as either dv/dt 
or v dv/dx. One of these will get the job done (namely, the one that makes only two 
out of the three variables, t, x, and v. appear in your differential equation). And 
then be prepared to separate variables and integrate as many times as needed. 


a is dv by dt. 

Is this useful? There’s no guarantee. 

If it leads to “Oh, heck!” ’s, 

Take dv by dx, 

And then write down its product with v. 


Example 1 (Gravitational force): A particle of mass m is subject to a constant 
force F = —mg. The particle starts at rest at height h. Because this constant force 
falls into all of the above three categories, we should be able to solve for y{t) in two 
ways: 

(a) Find y(t) by writing a as dv/dt. 

(b) Find y(t) by writing a as vdv/dy. 
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Solution: 

(a) F = ma gives dv/dt = —g. Integrating this yields v = —gt + C, where C is a 
constant of integration. 11 The initial condition u(0) = 0 gives (7 = 0. Therefore, 
dy/dt = —gt. Integrating this and using y(0) = h gives 

y = h-\gt 2 . (2.27) 

(b) F = ma gives vdv/dy = —g. Separating variables and integrating yields v 2 /2 = 
—gy + C. The i nitial cond ition v(Q) = 0 gives v' 2 /2 = —gy + gh. Therefore, 
v = dy/dt = —y/2g(h — y). We have chosen the negative square root, because 
the particle is falling. Separating variables gives 

j j dt. (2.28) 

This yields 2 \/h — y = \/2gt, where we have used the initial condition y(Q) = h. 
Hence, y = h — gt 2 /2, in agreement with part (a). The solution in part (a) was 
clearly the simpler one. 


Example 2 (Dropped ball): A beach-ball is dropped from rest at height h. 
Assume 12 that the drag force from the air takes the form, F,j = —/3v. Find the 
velocity and height as a function of time. 


Solution: For simplicity in future formulas, let’s write the drag force as F d = —0v 
■ mar (so we won’t have a bunch of 1/m’s floating around). Taking upward to be 
the positive y direction, the force on the ball is 

F = —mg — mav. (2.29) 


Note that v is negative here, because the ball is falling, so the drag force points 
upward, as it should. Writing F = m dv/dt, and separating variables, gives 


r (,) dv' __ r l 
J 0 g + av' J 0 1 


Integration yields ln(l + av/g) = —at. Exponentiation then gives 


(2.30) 


«W = -|( l-e“ at ). 


(2.31) 


Writing dy/dt = v(t), and then separating variables and integrating to obtain y(t), 
yields 


sr^-ifs—) 


(2.32) 


11 We’ll do this example by adding on constants of integration which are then determined from 
the initial conditions. We’ll do the following example by putting the initial conditions in the limits 
of integration. 

12 The drag force is roughly proportional to v as long as the speed is fairly slow. For large speeds, 
the drag force is roughly proportional to v 2 . 
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Therefore, 

y (t) = . 


(2.33) 


(a) Let’s look at some limiting cases. If t is very small (more precisely, if at <C 1), then 
we can use e~ x w 1 — x + x 2 /2 to make approximations to leading order in t. You 
can show that eq. (2.31) gives v(t) ss —gt. This makes sense, because the drag force 
is negligible at the start, so the ball is essentially in free fall. And eq. (2.33) gives 
y(t) wk - gt 2 / 2, as expected. 

We can also look at large t. In this case, e~ at is essentially equal to zero, so eq. (2.31) 
gives v(t) « —g/a. (This is the “terminal velocity.” Its value makes sense, because it 
is the velocity for which the total force, —mg — mav, vanishes.) And eq. (2.33) gives 
y(t) « h — ( g/a)t + g/a 2 . Interestingly, we see that for large t, g/a 2 is the distance 
our ball lags behind another ball which started out already at the terminal velocity, 
g/a. 

(b) The velocity of the ball obtained in eq. (2.31) depends on a , which was defined via 
Fa = —mav. We explicitly wrote the m here just to make all of our formulas look a 
little nicer, but it should not be inferred that the velocity of the ball is independent of 
m. The coefficient /3 = ma depends (in some complicated way) on the cross-sectional 
area, A, of the ball. Therefore, a oc A/m. Two balls of the same size, one made of 
lead and one made of styrofoam, will have the same A but different m’s. Hence, their 
ck’s will be different, and they will fall at different rates. 

For heavy objects in a thin medium such as air, a is small, so the drag effects are not 
very noticeable over short distances. Heavy objects fall at roughly the same rate. If 
the air were a bit thicker, different objects would fall at noticeably different rates, and 
maybe it would have taken Galileo a bit longer to come to his conclusions. 

What would you have thought, Galileo, 

If instead you dropped cows and did say, “Oh! 

To lessen the sound 

Of the moos from the ground, 

They should fall not through air, but through mayo!” £ 


2.4 Projectile motion 

Consider a ball thrown through the air, not necessarily vertically. We will neglect 
air resistance in the following discussion. 

Let x and y be the horizontal and vertical positions, respectively. The force in 
the x-direction is F x = 0, and the force in the y-direction is F y = —mg. So F = ma 
gives 

x = 0, and y = —g. (2.34) 

Note that these two equations are “decoupled.” That is, there is no mention of y 
in the equation for x, and vice-versa. The motions in the x- and y-directions are 
therefore completely independent. 

Remark: The classic demonstration of the independence of the x- and y-rnotions is the 
following. Fire a bullet horizontally (or, preferably, just imagine firing a bullet horizontally), 
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and at the same time drop a bullet from the height of the gun. Which bullet will hit the 
ground first? (Neglect air resistance, the curvature of the earth, etc.) The answer is that 
they will hit the ground at the same time, because the effect of gravity on the two y-motions 
is exactly the same, independent of what is going on in the ^-direction, f 

If the initial position and velocity are (X,Y) and (V x . V y ). then we can easily 
integrate eqs. (2.34) to obtain 


x(t) = V x , 

&(t) = Vy- gt. (2.35) 

Integrating again gives 

x(t) = X + V x t, 

y (t) = Y + V y t- * gt 2 . (2.36) 

These equations for the speeds and positions are all you need to solve a projectile 
problem. 


Example (Throwing a ball): 

(a) For a given initial speed, at what inclination angle should a ball be thrown so 
that it travels the maximum horizontal distance by the time it returns to the 
ground? Assume that the ground is horizontal, and that the ball is released 
from ground level. 

(b) What is the optimal angle if the ground is sloped upward at an angle /3 (or 
downward, if fi is negative)? 

Solution: 

(a) Let the inclination angle be 6, and let the initial speed be V. Then the horizontal 
speed is always V x = V cos 6, and the initial vertical speed is V y = V sin 9. 

The first thing we need to do is find the time t in the air. We know that the 
vertical speed is zero at time t/ 2, because the ball is moving horizontally at 
its highest point. So the second of eqs. (2.35) gives V y = g(t/ 2). Therefore, 
t = 2 Vy/g . 13 

The first of eqs. (2.36) tells us that the horizontal distance traveled is d = V x t. 
Using t = 2 V y /g in this gives 

. 2 V x Vy U 2 (2sin0cos0) U 2 sin20 _ 

a = -= -= -. (2.37) 

g g g 

The sin 26 factor has a maximum at 

0 = J . (2.38) 

13 Alternatively, the time of flight can be found from the second of eqs. (2.36), which says that 

the ball returns to the ground when V y t s= gt 2 / 2. We will have to use this type of strategy in part 

(b), where the trajectory is not symmetric around the maximum. 
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The maximum horizontal distance traveled is then d rnax = V 2 /g. 

Remarks: For 9 = 7r/4, you can show that the maximum height achieved is V 2 /Ag. 
This may be compared to the maximum height of V 2 /2g (as you can show) if the ball 
is thrown straight up. Note that any possible distance you might want to find in this 
problem must be proportional to V 2 /g, by dimensional analysis. The only question is 
what the numerical factor is. Jl» 

(b) As in part (a), the first thing we need to do is find the time t in the air. If the 
ground is sloped at an angle (3, then the equation for the line of the ground is 

y = (tan/?)®. (2.39) 

The path of the ball is given in terms of t by 

x = (Vcos9)t, and y = (V sin 9)t — ^gt 2 . (2.40) 


We must solve for the t that makes y = (tan/?)®, because this gives the place 
where the path of the ball intersects the line of the ground. Using eqs. (2.40), 
we find that y = (tan f3)x when 


(There is, of course, also the solution t = 0.) Plugging this into the expression 
for x in eq. (2.40) gives 


= -(sin 0 cos 0 — tan (3 cos 2 6). 


We must now maximize this value for x, which is equivalent to maximizing 
the distance along the slope. Setting the derivative with respect to 9 equal to 
zero, and using the double-angle formulas, sin 29 = 2 sin 9 cos 9 and cos 2 9 = 
cos 2 9 — sin 2 #, we find tan/? = —cot 29. This can be rewritten as tan (3 = 
- tan(7r/2 - 29). Therefore, (3 = — (tt/2 - 29), so we have 


In other words, the throwing angle should bisect the angle between the ground 
and the vertical. 


Remarks: For /? « tt/2, we have 0 rs tt/2, as should be the case. For j3 = 0, we have 
9 = 7t/4, as we found in part (a). And for /? « —tt/2, we have 6 « 0, which makes 


Substituting the value of 9 from eq. (2.43) into eq. (2.42), you can show (after a bit 
of algebra) that the maximum distance traveled along the tilted ground is 


This checks in the various limits for /?. £ 
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Along with the bullet example mentioned above, another classic example of the 
independence of the x- and ^/-motions is the “hunter and monkey” problem. In it, 
a hunter aims an arrow (made of styrofoam, of course) at a monkey hanging from 
a branch in a tree. The monkey, thinking he’s being clever, tries to avoid the arrow 
by letting go of the branch right when he sees the arrow released. The unfortunate 
consequence of this action is that he will get hit, because gravity acts on both him 
and the arrow in the same way; they both fall the same distance relative to where 
they would have been if there were no gravity. And the monkey would get hit in 
such a case, because the arrow is initially aimed at him. You can work this out in 
Exercise 16, in a more peaceful setting involving fruit. 

If a monkey lets go of a tree, 

The arrow will hit him, you see, 

Because both heights are pared 
By a half gt 2 

From what they would be with no g. 

2.5 Motion in a plane, polar coordinates 

When dealing with problems where the motion lies in a plane, it is often conve¬ 
nient to work with polar coordinates, r and 6. These are related to the Cartesian 
coordinates by (see Fig. 2.6) 

x = rcos9, and y = rsm6. (2-45) 

Depending on the problem, either Cartesian or polar coordinates will be easier to 
use. It is usually clear from the setup which is better. For example, if the problem 
involves circular motion, then polar coordinates are a good bet. But to use polar 
coordinates, we need to know what form Newton’s second law takes in terms of 
them. Therefore, the goal of the present section is to determine what F = ma = mi 
looks like when written in terms of polar coordinates. 

At a given position r in the plane, the basis vectors in polar coordinates are r, 
which is a unit vector pointing in the radial direction; and 9 , which is a unit vector 
pointing in the counterclockwise tangential direction. In polar coords, a general 
vector may therefore be written as 


r = rr. (2.46) 

Note that the directions of the r and 9 basis vectors depend, of course, on r. 

Since the goal of this section is to find r, we must, in view of eq. (2.46), get a 
handle on the time derivative of r. And we’ll eventually need the derivative of 9 , 
too. In contrast with the fixed Cartesian basis vectors (x and y), the polar basis 
vectors (r and 9 ) change as a point moves around in the plane. 

We can find f and 9 in the following way. In terms of the Cartesian basis, 
Fig. 2.7 shows that 
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r = cos 9 x + sin 9 y, 

6 = — sin 9 x + cos 9 y . (2.47) 

Taking the time derivative of these equations gives 

r = — sin 9 Qit + cos 0 Oy, 

6 = — cos 9 0x — sin 6 Oy . (2.48) 

Using eqs. (2.47), we arrive at the nice clean expressions, 

f = 90, and 0 = -Or. (2.49) 

These relations are fairly evident if we look at what happens to the basis vectors as 
r moves a tiny distance in the tangential direction. Note that the basis vectors do 
not change as r moves in the radial direction. 

We can now start differentiating eq. (2.46). One derivative gives (yes, the 
product rule works fine here) 

r = rr + rr 

= rr + rdd. (2.50) 

This makes sense, because f is the speed in the radial direction, and rd is the speed 
in the tangential direction, which is often written as wr (where uj = 9 is the angular 
speed, or “angular frequency”). 14 

Differentiating eq. (2.50) then gives 

r = rr + ri + rdd + rdd + rdd 

= rr + r(9d) + rOO + rOO + r9(-9r ) 

= {r-rd 2 )r + (r0 + 2rd)d. (2.51) 

Finally, equating mr with F = F r r + Fed gives the radial and tangential forces as 
F r = m(r — r9 2 ), 

F e = m(r0 + 2rd). (2.52) 

(See Exercise 32 for a slightly different derivation of these equations.) Let’s look at 
each of the four terms on the right-hand sides of eqs. (2.52). 

• The mr term is quite intuitive. For radial motion, it simply states that F = ma 
along the radial direction. 

• The mrO term is also quite intuitive. For circular motion, it states that F = 
ma along the tangential direction, because rO is the second derivative of the 
distance rO along the circumference. 

14 For rO to be the tangential speed, we must measure 9 in radians and not degrees. Then rO is 
by definition the distance along the circumference, so r6 is the speed along the circumference. 
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• The —mrO 2 term is also fairly clear. For circular motion, it says that the radial 
force is —m{rO) 2 /r = —mv 2 /r, which is the familiar force that causes the 
centripetal acceleration, v 2 /r. See Problem 19 for an alternate (and quicker) 
derivation of this v 2 /r result. 

• The 2 mf6 term isn’t so obvious. It is called the Coriolis force. There are 
various ways to look at this term. One is that it exists in order to keep angular 
momentum conserved. We’ll have a great deal to say about the Coriolis force 
in Chapter 9. 


Example (Circular pendulum): A mass hangs from a massless string of length 
I. Conditions have been set up so that the mass swings around in a horizontal circle, 
with the string making an angle 0 with the vertical (see Fig. 2.8). What is the 
angular frequency, w, of this motion? 

Solution: The mass travels in a circle, so the horizontal radial force must be 
F r = mr() 2 = mrw 2 (with r = i sin 0 ), directed radially inward. The forces on 
the mass are the tension in the string, T, and gravity, mg (see Fig. 2.9). There is no 
acceleration in the vertical direction, so F = ma in the vertical and radial directions 
gives, respectively, 


T cos 0 = mg, 

T sin/? = m(lsm.0)J 2 . (2.53) 


Solving for u> gives 


(2.54) 


Note that if 0 ss 0, then cu « yfgfl, which equals the frequency of a plane pendulum 
of length t. And if p « 90°, then u> oo, which makes sense. 



Figure 2.8 



mg 

Figure 2.9 
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6. Three-mass Atwood’s ** 

Consider the Atwood’s machine in Fig. 2.15, with masses m, 2m, and 3m. 
Find the accelerations of all three masses. 

7. Accelerating plane ** 

A block of mass m rests on a plane inclined at angle 9. The coefficient of static 
friction between the block and the plane is fi. The plane is accelerated to the 
right with acceleration a (which may be negative); see Fig. 2.16. For what 
range of a does the block remain at rest with respect to the plane? 

8. Accelerating cylinders ** 

Three identical cylinders are arranged in a triangle as shown in Fig. 2.17, 
with the bottom two lying on the ground. The ground and the cylinders are 
frictionless. You apply a constant horizontal force (directed to the right) on 
the left cylinder. Let a be the acceleration you give to the system. For what 
range of a will all three cylinders remain in contact with each other? 

Section 2.3: Solving differential equations 

9. —bv 2 force * 

A particle of mass m is subject to a force F(v') = —bv 2 . The initial position 
is zero, and the initial speed is vq. Find x(i). 

10. — kx force ** 

A particle of mass m is subject to a force F(x) = —kx. The initial position is 
zero, and the initial speed is vq. Find x(t). 




Figure 2.15 



Figure 2.16 


A 

Figure 2.17 


11. kx force ** 

A particle of mass m is subject to a force F(x) = kx. The initial position is 
zero, and the initial speed is vo- Find x(t). 

12. Motorcycle circle *** 

A motorcyclist wishes to travel in a circle of radius R on level ground. The 
coefficient of friction between the tires and the ground is n. The motorcycle 
starts at rest. What is the minimum distance the motorcycle must travel in 
order to achieve its maximum allowable speed (that is, the speed above which 
it will skid out of the circular path)? 


Section 2-4: Projectile motion 

13. Dropped balls 

A ball is dropped from height Ah. After it has fallen a distance d, a second 
ball is dropped from height h. What should d be (in terms of h ) so that the 
balls hit the ground at the same time? 


*-Q 
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14. Equal distances 

At what angle should a ball be thrown so that its maximum height equals the 
horizontal distance traveled? 

15. Redirected horizontal motion * 

A ball is dropped from rest at height h, and it bounces off a surface at height y. 
with no loss in speed. The surface is inclined at 45°, so that the ball bounces 
off horizontally. What should y be so that the ball travels the maximum 
horizontal distance? 



Figure 2.18 



Figure 2.19 


d 


Figure 2.20 


16. Newton’s apple * 

Newton is tired of apples falling on his head, so he decides to throw a rock 
at one of the larger and more formidable looking apples positioned directly 
above his favorite sitting spot. Forgetting all about his work on gravitation, 
he aims the rock directly at the apple (see Fig. 2.18). To his surprise, the 
apple falls from the tree just as he releases the rock. Show, by calculating 
the rock’s height when it reaches the horizontal position of the apple, that the 
rock will hit the apple. 15 

17. Throwing at a wall * 

You throw a ball with speed Vo at a vertical wall, a distance I away. At what 
angle should you throw the ball, so that it hits the wall at a maximum height? 
Assume t <Vq jg (why?). 

18. Firing a cannon ** 

A cannon, when aimed vertically, is observed to fire a ball to a maximum 
height of L. Another ball is then fired with this same speed, but with the 
cannon now aimed up along a plane of length L, inclined at an angle 0, as 
shown in Fig. 2.19. What should 6 be, so that the ball travels the largest 
horizontal distance, d, by the time it returns to the height of the top of the 
plane? 

19. Colliding projectiles * 

Two balls are fired from ground level, a distance d apart. The right one is 
fired vertically with speed V; see Fig. 2.20. You wish to simultaneously fire 
the left one at the appropriate velocity u so that it collides with the right ball 
when they reach their highest point. What should u be (give the horizontal 
and vertical components)? Given d, what should V be so that the speed u is 
minimum? 


ls This problem suggests a way in which William Tell and his son might survive their ordeal if 
they were plopped down on a planet with an unknown gravitational constant (provided that the 
son weren’t too short or g weren’t too big). 
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20. Throwing in the wind * 

A ball is thrown horizontally to the right, from the top of a vertical cliff of 
height h. A wind blows horizontally to the left, and assume (simplistically) 
that the effect of the wind is to provide a constant force to the left, equal in 
magnitude to the weight of the ball. How fast should the ball be thrown, so 
that it lands at the foot of the cliff? 

21. Throwing in the wind again * 

A ball is thrown eastward across level ground. A wind blows horizontally to 
the east, and assume (simplistically) that the effect of the wind is to provide 
a constant force to the east, equal in magnitude to the weight of the ball. 
At what angle 6 should the ball be thrown, so that it travels the maximum 
horizontal distance? 

22. Increasing gravity * 

At t = 0 on the planet Gravitus Increasicus, a projectile is fired with speed Vo 
at an angle 6 above the horizontal. This planet is a strange one, in that the 
acceleration due to gravity increases linearly with time, starting with a value 
of zero when the projectile is fired. In other words, g(t) = /3t , where P is a 
given constant. What horizontal distance does the projectile travel? What 
should 9 be so that this horizonal distance is maximum? 

23. Cart, ball, and plane ** 

A cart rolls down an inclined plane. A ball is fired from the cart, perpendic¬ 
ularly to the plane. Will the ball eventually land in the cart? Hint: Choose 
your coordinate system wisely. 

Section 2.5: Motion in a plane, polar coordinates 

24. Low-orbit satellite 

What is the speed of a satellite whose orbit is just above the earth’s surface? 
Give the numerical value. 

25. Weight at the equator * 

A person stands on a scale at the equator. If the earth somehow stopped 
spinning but kept its same shape, would the reading on the scale increase or 
decrease? By what fraction? 

26. Banking an airplane * 

An airplane flies at speed v in a horizontal circle of radius R. At what angle 
should the plane be banked so that you don’t feel like you are getting flung to 
the side in your seat? 

27. Car on a banked track ** 

A car travels around a circular banked track with radius R. The coefficient of 
friction between the tires and the track is /x. What is the maximum allowable 
speed, above which the car slips? 



side point 



Figure 2.21 


11-22 CHAPTER 2. USING F = MA 

28. Driving on tilted ground ** 

A driver encounters a large tilted parking lot, where the angle of the ground 
with respect to the horizontal is 9. The driver wishes to drive in a circle of 
radius R, at constant speed. The coefficient of friction between the tires and 
the ground is n. 

(a) What is the largest speed the driver can have if he wants to avoid slipping? 

(b) What is the largest speed the driver can have, assuming he is concerned 
only with whether or not he slips at one of the “side” points on the circle 
(that is, halfway between the top and bottom points; see Fig. 2.21)? 

29. Rolling wheel * 

If you paint a dot on the rim of a rolling wheel, the coordinates of the dot may 
be written as 16 

(x, y) = (R9 + Rstn.9, R + Rcos9). (2.55) 

The path of the dot is called a cycloid. Assume that the wheel is rolling at 
constant speed, which implies 9 = ut. 

(a) Find v(t) and a(t) of the dot. 

(b) At the instant the dot is at the top of the wheel, it may be considered to 
be moving along the arc of a circle. What is the radius of this circle in 
terms of R1 Hint You know v and a. 

30. Bead on a hoop ** 

A bead rests on top of a frictionless hoop of radius R which lies in a vertical 
plane. The bead is given a tiny push so that it slides down and around the 
hoop. At what points on the hoop (specify them by giving the angular position 
relative to the top) is the bead’s acceleration vertical? 17 What is this vertical 
acceleration? Note: We haven’t studied conservation of energy yet, but use the 
fact that the bead’s speed after it has fallen a height h is given by v = \/2gh. 

31. Another bead on a hoop ** 

A bead rests on top of a frictionless hoop of radius R which lies in a vertical 
plane. The bead is given a tiny push so that it slides down and around 
the hoop. At what points on the hoop (specify them by giving the angular 
position relative to the horizontal) is the bead’s acceleration horizontal? As 
in the previous exercise, use v = y/2gh. 


16 This can be shown by writing (x,y) as ( R0,R ) + (RsinO, Rcos0). The first term here is the 
position of the center of the wheel, and the second term is the position of the dot relative to the 
center, where 6 is measured clockwise from the top. 

17 One such point is the bottom of the hoop. Another point is technically the top, where a = 0. 
Find the other two more interesting points (one on each side). 
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32. Derivation of F r and Fg ** 

In Cartesian coordinates, a general vector takes the form, 

r = xx + yy 

= r cos 0 x + r sin 9 y. (2.56) 

Derive eqs. (2.52) by taking two derivatives of this expression for r, and then 
using eqs. (2.47) to show that the result can be written in the form of eq. 
(2.51). Note that unlike r and 6. the vectors x and y do not change with 
time. 

33. A force Fg = 2 r6 ** 

Consider a particle that feels an angular force only, of the form Fg = 2 mfO. 
(As in Problem 21, there’s nothing all that physical about this force; it simply 
makes the F = ma equations solvable.) Show that the trajectory takes the 
form of an exponential spiral, that is, r = Ae d . 

34. A force Fg = 3 ff) ** 

Consider a particle that feels an angular force only, of the form Fg = SmrO. 
(As in the previous exercise, we’re solving this problem simply because we 
can.) Show that f = B. Also, show that the particle reaches r = oo 

in a finite time. 
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2.7 Problems 



Figure 2.22 



m l m 2 

Figure 2.23 



Figure 2.24 



Section 2.2: Free-body diagrams 

1. Sliding down a plane ** 

(a) A block starts at rest and slides down a frictionless plane inclined at angle 
9. What should 9 be so that the block travels a given horizontal distance 
in the minimum amount of time? 

(b) Same question, but now let there be a coefficient of kinetic friction, /j, 
between the block and the plane. 

2. Moving plane *** 

A block of mass m is held motionless on a frictionless plane of mass M and 
angle of inclination 9 (see Fig. 2.22). The plane rests on a frictionless hori¬ 
zontal surface. The block is released. What is the horizontal acceleration of 
the plane? 

3. Sliding sideways on plane *** 

A block is placed on a plane inclined at angle 9. The coefficient of friction 
between the block and the plane is // = tan 9. The block is given a kick so 
that it initially moves with speed V horizontally along the plane (that is, in 
the direction perpendicular to the direction pointing straight down the plane). 
What is the speed of the block after a very long time? 

4. Atwood’s machine 

A massless pulley hangs from a fixed support. A massless string connecting 
two masses, m\ and m 2 , hangs over the pulley (see Fig. 2.23). Find the 
acceleration of the masses and the tension in the string. 

5. Double Atwood’s machine ** 

A double Atwood’s machine is shown in Fig. 2.24, with masses mi, m 2 , and 
m 3 . What are the accelerations of the masses? 

6. Infinite Atwood’s machine *** 

Consider the infinite Atwood’s machine shown in Fig. 2.25. A string passes 
over each pulley, with one end attached to a mass and the other end attached to 
another pulley. All the masses are equal to m, and all the pulleys and strings 
are massless. The masses are held fixed and then simultaneously released. 
What is the acceleration of the top mass ? 18 


18 You may define this infinite system as follows. Consider it to be made of N pulleys, with a 
non-zero mass replacing what would have been the (N+ l)st pulley. Then take the limit as N —> 00 . 


Figure 2.25 
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7. Line of pulleys * 

N + 2 equal masses hang from a system of pulleys, as shown in Fig. 2.26. 
What are the accelerations of all the masses? 

8. Ring of pulleys ** 

Consider the system of pulleys shown in Fig. 2.27. The string (which is a 
loop with no ends) hangs over N fixed pulleys. N masses, m\, m2, ■ ■., rri n , 
are attached to N pulleys that hang on the string. What are the accelerations 
of all the masses? 

Section 2.3: Solving differential equations 

9. Exponential force 

A particle of mass m is subject to a force F(t) = me~ bt . The initial position 
and speed are zero. Find x(t). 

10. Falling chain ** 

A chain of length l is held stretched out on a frictionless horizontal table, with 
a length yo hanging down through a hole in the table. The chain is released. 
As a function of time, find the length that hangs down through the hole (don’t 
bother with t after the chain loses contact with the table). Also, find the speed 
of the chain right when it loses contact with the table. 

11. Circling around a pole ** 

A mass, which is free to move on a horizontal frictionless plane, is attached to 
one end of a massless string which wraps partially around a frictionless vertical 
pole of radius r (see the top view in Fig. 2.28). You hold onto the other end 
of the string. At t = 0, the mass has speed no in the tangential direction along 
the dotted circle of radius R shown. 

Your task is to pull on the string so that the mass keeps moving along the 
dotted circle. You are required to do this in such a way that the string remains 
in contact with the pole at all times. (You will have to move your hand around 
the pole, of course.) What is the the speed of the mass as a function of time? 

12. Throwing a beach ball *** 

A beach ball is thrown upward with initial speed no- Assume that the drag 
force from the air is F = —mav. What is the speed of the ball, Vf. when 
it hits the ground? (An implicit equation is sufficient.) Does the ball spend 
more time or less time in the air than it would if it were thrown in vacuum? 

13. Balancing a pencil *** 

Consider a pencil that stands upright on its tip and then falls over. Let’s 
idealize the pencil as a mass m sitting at the end of a massless rod of length 
A 19 

19 It actually involves only a trivial modification to do the problem correctly using the moment of 
inertia and the torque. But the point-mass version will be quite sufficient for the present purposes. 


N= 3 

Figure 2.26 



m N m 1 m 2 

Figure 2.27 



Figure 2.28 
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(a) Assume that the pencil makes an initial (small) angle 9q with the vertical, 
and that its initial angular speed is uq. The angle will eventually become 
large, but while it is small (so that sin0 ~ 9), what is 9 as a function of 
time? 

(b) You might think that it would be possible (theoretically, at least) to make 
the pencil balance for an arbitrarily long time, by making the initial 9o 
and ujq sufficiently small. 

However, it turns out that due to Heisenberg’s uncertainty principle 
(which puts a constraint on how well we can know the position and mo¬ 
mentum of a particle), it is impossible to balance the pencil for more than 
a certain amount of time. The point is that you can’t be sure that the 
pencil is initially both at the top and at rest. The goal of this problem 
is to be quantitative about this. The time limit is sure to surprise you. 
Without getting into quantum mechanics, let’s just say that the uncer¬ 
tainty principle says (up to factors of order 1) that AxAp > ft (where 
ft = 1.06 • 10 -34 Js is Planck’s constant). The implications of this are 
somewhat vague, but we’ll just take it to mean that the initial conditions 
satisfy (£9o)(m£u>o) > ft. 

With this condition, find the maximum time it can take your solution 
in part (a) to become of order 1. In other words, determine (roughly) 
the maximum time the pencil can balance. Assume m = 0.01 kg, and 
f = 0.1m. 

Section 2.j: Projectile motion 

14. Throwing a ball from a cliff ** 

A ball is thrown with speed v from the edge of a cliff of height ft. At what 
inclination angle should it be thrown so that it travels the maximum horizontal 
distance? What is this maximum distance? Assume that the ground below 
the cliff is horizontal. 

15. Redirected motion ** 

A ball is dropped from rest at height ft, and it bounces off a surface at height 
y (with no loss in speed). The surface is inclined so that the ball bounces off 
at an angle of 9 with respect to the horizontal. What should y and 9 be so 
that the ball travels the maximum horizontal distance? 

16. Maximum trajectory length *** 

A ball is thrown at speed v from zero height on level ground. Let 9q be the 
angle at which the ball should be thrown so that the distance traveled through 
the air is maximum. Show that 9q satisfies 

sm<Mn(Y^)=l. (2.57) 

V cos 0 O / 

You can show numerically that 9q ~ 56.5°. 
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17. Maximum trajectory area * 

A ball is thrown at speed v from zero height on level ground. At what angle 
should it be thrown so that the area under the trajectory is maximum? 

18. Bouncing ball * 

A ball is thrown straight upward so that it reaches a height h. It falls down 
and bounces repeatedly. After each bounce, it returns to a certain fraction 
/ of its previous height. Find the total distance traveled, and also the total 
time, before it comes to rest. What is its average speed? 

Section 2.5: Motion in a plane, polar coordinates 

19. Centripetal acceleration * 

Show that the acceleration of a particle moving in a circle is v 2 /r. To do this, 
draw the position and velocity vectors at two nearby times, and then make 
use of some similar triangles. 

20. Free particle ** 

Consider a free particle in a plane. Using Cartesian coordinates, it is trivial to 
show that the particle moves in a straight line. The task of this problem is to 
demonstrate this result in a much more cumbersome way, using eqs. (2.52). 
More precisely, show that cos 6 = rg/r for a free particle, where ro is the radius 
at closest approach to the origin, and 9 is measured with respect to this radius. 

21. A force Fg = rO ** 

Consider a particle that feels an angular force only, of the form Fg = mrO. 
(There’s nothing all that physical about this force. It simply makes the F = 
ma equations solvable.) Show that r = •</Ain r + where A and B are 
constants of integration, determined by the initial conditions. 
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2.8 Solutions 

1. Sliding down a plane 

(a) The component of gravity along the plane in gsin#. The acceleration in the 
horizontal direction is therefore a x = (g sin 9) cos 9. Our goal is to maximize a x . 
By taking the derivative, or by noting that sin 9 cos 9 = (sin20)/2, we obtain 
9 = 7t/4. 

(b) The normal force from the plane is mg cos 9, so the kinetic friction force is 
gmg cos 9. The acceleration along the plane is therefore g(sin 9 — g cos 9) , and 
so the acceleration in the horizontal direction is a x = 5(sin 9 — g cos 6) cos 9. We 
want to maximize this. Setting the derivative equal to zero gives 

(cos 2 9 — sin 2 9) + 2/zsin0cos0 = 0 => cos 26 + g sin 29 = 0 

=> tan 20 = — — . (2.58) 

it 

For g —> 0, this gives the 7 t /4 result from part (a). For g —* 00, we obtain 
9 Rj 7 t/ 2, which makes sense. 

Remark: The time to travel a horizontal distance d is obtained from a x t 2 / 2 = d. 
In part (a), this gives a minimum time of 2 \Jdfg. In part (b), you can show that 
the maximum a x is (g/ 2)(y/l + g 2 — g), and that this leads to a minimum time of 
2y/ d/g\J + g? + g. This has the correct g —> 0 limit, and it behaves like 2^/2 gd/g 
for g —> 00 . ft 

2. Moving plane 

Let N be the normal force between the block and the plane. Note that we cannot 
assume that N = mg cos 9, because the plane recoils. We can see that N = mg cos 9 
is in fact incorrect, because in the limiting case where M = 0, we have no normal 
force at all. 

The various F = ma equations (vertical and horizontal for the block, and horizontal 
for the plane) are 

mg — N cos 9 = ma y , 

Nsin9 = ma x , 

N sin 9 = MA X , (2.59) 

where we have chosen the positive directions for a y , a x , and A x to be downward, 
rightward, and leftward, respectively. There are four unknowns here: a x , a y , A x , and 
N. So we need one more equation. This fourth equation is the constraint that the 
block remains in contact with the plane. The horizontal distance between the block 
and its starting point on the plane is (a x +A x )t 2 /2, and the vertical distance is a y t 2 / 2. 
The ratio of these distances must equal tan 9 if the block is to remain on the plane. 
Therefore, we must have 

^ A = t an ^- (2.60) 

Using eqs. (2.59), this becomes 
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(In the limit M —> oo, this reduces to N = mg cos 9, as it should.) Having found N, 
the third of eqs. (2.59) gives A x , which may be written as 


N sin 9 mg sin 9 cos 9 
M M + m sin 2 9 ' 


(2.62) 


Remarks: For given M and m, you can show that the angle Oo that maximizes A x is 



If M < m, then Oo « 0. If M » m, then Oo » 7 t/4 . 

In the limit M -C m, eq. (2.62) gives A x as <7/tan#. This makes sense, because m falls 
essentially straight down, and the plane gets squeezed out to the left. 

In the limit M > m, we have A x as g(m/M) sin 0 cos 0. This is more transparent if we instead 
look at a x = ( M/m)A x as g sin# cos#. Since the plane is essentially at rest in this limit, this 
value of a x implies that the acceleration of m along the plane is equal to a x / cos# as <7sin#, 
as expected. A 

3. Sliding sideways on plane 

The normal force from the plane is N = mg cos 9. Therefore, the friction force on 
the block is gN = (tan 9)N = mg sin 0. This force acts in the direction opposite to 
the motion. The block also feels the gravitational force of mg sin 9 pointing down the 
plane. 

Because the magnitudes of the friction force and the gravitational force along the 
plane are equal, the acceleration along the direction of motion equals the negative 
of the acceleration in the direction down the plane. Therefore, in a small increment 
of time, the speed that the block loses along its direction of motion exactly equals 
the speed that it gains in the direction down the plane. Letting v be the speed of 
the block, and letting v y be the component of the velocity in the direction down the 
plane, we therefore have 

v + v y = C, (2.64) 

where C is a constant. C is given by its initial value, which is V + 0 = V. The final 
value of C is V f + Vf = 2 Vf (where Vf is the final speed of the block), because the 
block is essentially moving straight down the plane after a very long time. Therefore, 

2 V f = V V f = J- ( 2 ‘ 65 ) 


4. Atwood’s machine 

Let T be the tension in the string, and let a be the acceleration of mi (with upward 
taken to be positive). Then —a is the acceleration of m 2 . So we have 


T — mig = m^a, 

T-m 2 g = m 2 (—a). 

Solving these two equations for a and T gives 

(m 2 - mi)g 2m-i_m 2 g 

a = -, and T = -. 

m 2 + mi m 2 + mi 


( 2 . 66 ) 


(2.67) 


Remarks: As a double-check, a has the correct limits when m2 mi, mi m 2 , and 
m 2 = mi (namely a « g, a « —g, and a = 0, respectively). 
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As far as T goes, if mi = m2 = m, then T = mg, as it should. And if mi -C m 2 , then 
T s# 2mig. This is correct, because it makes the net upward force on mi equal to mig, 
which means that its acceleration is g upward, which is consistent with the fact that m2 is 
essentially in free fall. Jl» 

5. Double Atwood’s machine 

Let the tension in the lower string be T. Then the tension in the upper string is 2 T 
(by balancing the forces on the bottom pulley). The three F = ma equations are 
therefore (with all the a’s taken to be positive upward) 


2 T — mig = miai, 

T — rrizg = m^a^, 

T-m 3 g = m 3 a 3 . (2.68) 


And conservation of string says that the acceleration of mi is 


ai — — 


ci 2 + a 3 \ 

2 ) 


(2.69) 


This follows from the fact that the average position of m 2 and m 3 moves the same 
distance as the bottom pulley, which in turn moves the same distance (but in the 
opposite direction) as mi. 

We now have four equations in the four unknowns, a\, a 2, 0.3, and T. With a little 
work, we can solve for the accelerations, 


4m 2 m 3 - mi (m2 + m3) 
5 4m 2 m 3 + mi (m2 + m 3 ) ’ 
4m2m 3 + mi (m2 - 3m 3 ) 
4m2m 3 + mi (m 2 + m 3 ) ’ 
4m2m 3 + mi(m 3 — 3m2) 

3 4m 2 m 3 + mi(m 2 + m 3 ) 


(2.70) 


Remarks: There are many limits we can check here. A couple are: (1) If m2 = m3 = mi/2, 
then all the a’s are zero, which is correct. (2) If m 3 is much less than both mi and m2, then 
ai = — g, a ,2 = —g, and a 3 = 3 g. To understand this -ig, convince yourself that if mi and 
m2 go down by d, then m 3 goes up by 3d. 

Note that ai can be written as 



In view of the result of Problem 4 in eq. (2.67), we see that as far as mi is concerned, the 
m 2 ,m 3 pulley system acts just like a mass of 4rri2m 3 /(m2 + m 3 ). This has the expected 
properties of equaling zero when either m2 or m 3 is zero, and equaling 2m if m2 = m 3 = m. 

* 

6. Infinite Atwood’s machine 

First Solution: If the strength of gravity on the earth were multiplied by a factor 
r], then the tension in all of the strings in the Atwood’s machine would likewise be 
multiplied by rj. This is true because the only way to produce a quantity with the 
units of tension (that is, force) is to multiply a mass by g. Conversely, if we put 
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the Atwood’s machine on another planet and discover that all of the tensions are 
multiplied by r], then we know that the gravity there must be rjg. 

Let the tension in the string above the first pulley be T. Then the tension in the string 
above the second pulley is T/2 (because the pulley is massless). Let the downward 
acceleration of the second pulley be 02 . Then the second pulley effectively lives in a 
world where gravity has strength g — a 2 . 

Consider the subsystem of all the pulleys except the top one. This infinite subsystem is 
identical to the original infinite system of all the pulleys. Therefore, by the arguments 
in the first paragraph above, we must have 


T _ T/2 
9 9- 02 ’ 


(2.72) 


which gives o 2 = g/2. But a 2 is also the acceleration of the top mass, so our answer 
is g/2. 


Remarks: You can show that the relative acceleration of the second and third pulleys is 
g/4, and that of the third and fourth is g/8 , etc. The acceleration of a mass far down in the 
system therefore equals g( 1/2 + 1/4 + 1/8 + ••■)= g, which makes intuitive sense. 

Note that T = 0 also makes eq. (2.72) true. But this corresponds to putting a mass of zero 
at the end of a finite pulley system (see the following solution). A 


Second Solution: Consider the following auxiliary problem. 


Problem: Two setups are shown below in Fig. 2.29. The first contains a hanging 
mass to. The second contains a pulley, over which two masses, mi and m 2 , hang. Let 
both supports have acceleration a s downward. What should to be, in terms of mi 
and m 2 , so that the tension in the top string is the same in both cases? 

Answer: In the first case, we have 


mg 


— T = ma s . 


(2.73) 


In the second case, let a be the acceleration of m 2 relative to the support (with 
downward taken to be positive). Then we have 

toi(o s — a), 

m 2 (a s + a). (2.74) 

Note that if we define g' = g — a s , then we may write the above three equations as 

mg’ = T, 

, T 

m\g = — — m\a, 

T 

m 2 g = a- + m 2 o. (2.75) 



Eliminating a from the last two of these equations gives T = 4m 1 m 2 g'/ (m-| + m 2 ). 
Using this value of T in the first equation then gives 


4toiTO 2 
TOi + to 2 


(2.76) 



Figure 2.29 
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Note that the value of a s is irrelevant. We effectively have a fixed support in a world 
where the acceleration due to gravity is g' (see eqs. (2.75)), and the answer can’t 
depend on g ', by dimensional analysis. This auxiliary problem shows that the two- 
mass system in the second case may be equivalently treated as a mass m, given by 
eq. (2.76), as far as the upper string is concerned. ■ 


Now let’s look at our infinite Atwood’s machine. Assume that the system has N 
pulleys, where N —> oo. Let the bottom mass be x. Then the auxiliary problem 
shows that the bottom two masses, to and x, may be treated as an effective mass 
/(#), where 


f(x) 


4 mx 
Ax 

1 + ( x/m ) ' 


(2.77) 


We may then treat the combination of the mass f(x) and the next to as an effective 
mass f(f(x)). These iterations may be repeated, until we finally have a mass to and 
a mass f ( ' N ~ 1 \x) hanging over the top pulley. So we must determine the behavior of 
f N (x), as N -4 oc. This behavior is clear if we look at the following plot of f(x). 


y 



Note that x = 3to is a fixed point of f(x). That is, /(3 to) = 3m. This plot shows 
that no matter what x we start with, the iterations approach 3 to (unless we start at 
x = 0, in which case we remain there). These iterations are shown graphically by the 
directed lines in the plot. After reaching the value f(x) on the curve, the line moves 
horizontally to the x value of f(x), and then vertically to the value f(f(x)) on the 
curve, and so on. 

Therefore, since f N (x) —> 3 m as N —> oo, our infinite Atwood’s machine is equivalent 
to (as far as the top mass is concerned) just two masses, to and 3 to. You can then 
quickly show that that the acceleration of the top mass is g/2. 

Note that as far as the support is concerned, the whole apparatus is equivalent to a 
mass 3m. So 3mg is the upward force exerted by the support. 
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7. Line of pulleys 

Let m be the common mass, and let T be the tension in the string. Let a be the 
acceleration of the end masses, and let a' be the acceleration of the other N masses, 
with upward taken to be positive. Note that these N accelerations are indeed all 
equal, because the same net force acts on all of the internal N masses, namely 2 T 
upwards and mg downwards. The F = ma equations for the end and internal masses 
are, respectively, 


T — mg = ma, 

2 T-mg = ma'. (2.78) 

But the string has fixed length. Therefore, 

N(2a') + a + a = 0. (2.79) 

Eliminating T from eqs 
then gives 

a - 


(2.78) gives a = 2o + g. Combining this with eq. (2.79) 


Ng 


(2.80) 


Remarks: For N = 1, we have a = —g/ 3 and a' ’s= g/ 3. For larger N, a increases in 
magnitude and approaches —g/ 2 for N — * oo, and a' decreases in magnitude and approaches 
zero for N —> oo. 

The signs of a and o' in eq. (2.80) may be surprising. You might think that if, say, N = 100, 
then these 100 masses will ‘“win” out over the two end masses, so that the N masses will 
fall. But this is not correct, because there are many (2 N, in fact) tensions acting up on the 
N masses. They do not act like a mass Nrn hanging below one pulley. In fact, two masses 
of m/2 on the ends will balance any number N of masses in the interior (with the help of 
the upward forces from the top row of pulleys). & 

8. Ring of pulleys 

Let T be the tension in the string. Then F = ma for m, gives 

2 T — mig = m^i, (2-81) 


with upward taken to be positive. The aq’s are related by the fact that the string 
has fixed length, which implies that the sum of the displacements of all the masses is 
zero. In other words, 

fli + 02 dr"*" ■ + <ijv = 0. (2.82) 


If we divide eq. (2.81) by m*, and then add the N such equations together, we obtain, 
using eq. (2.82), 


2 T 


(mi 




Ng = 0. 


(2.83) 


Substituting this value for T into (2.81) gives 


o-i = 9 




(2.84) 


A few special cases are: If all the masses are equal, then all at = 0. If rnk = 0 (and all the 
others are not zero), then ak = (N — 1 )g, and all the other a* = —g. 
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9. Exponential force 

We are given x = e~ bt . Integrating this with respect to time gives v(t) = —e~ bt /b+A. 
Integrating again gives x(t ) = e~ bt /b 2 + At + B. The initial condition, v(Q) = 0, gives 
-1/6 + A = 0 => A = 1/6. And the initial condition, a;(0) = 0, gives 1/6 2 + B = 
0 => B = — 1/b 2 . Therefore, 

*<‘> = T? + 5-p- < 285 > 

Limits: For t —> oo, v approaches 1/6, and x approaches t/b— 1/b 2 . We see that the particle 
eventually lags a distance 1/6 2 behind another particle that started at the same position but 
with speed v —1/6. 

10. Falling chain 

Let the density of the chain be p, and let y(t) be the length hanging down through 
the hole at time t. Then the total mass is pi, and the mass hanging below the hole is 
py. The net downward force on the chain is ( py)g , so F = ma gives 

pgy = (pl)y => y= jy. (2.86) 

At this point, there are two ways we can proceed: 


First method: Since we have a function whose second derivative is proportional 
to itself, a good bet for the solution is an exponential function. And indeed, a quick 
check shows that the solution is 


y(t) = Ae at + Be at , where a=y^. (2.87) 

Taking the derivative of this to obtain y(t), and using the given information that 
y( 0) = 0, we find A = B. Using y(0) = y 0 , we then find A = B = yo/2. So the length 
that hangs below the hole is 


2/(i) = y (e at + e at ) = y 0 cosh(ai). (2.88) 


And the speed is 


y(t) = y 13 (e at - e at ) = ayo sinh(at). 


(2.89) 


The time T that satisfies y(T) = I is given by I = yocosh(aT). Using sinhx = 
\/ cosh 2 x — 1, we find that the speed of the chain right when it loses contact with the 
table is 

y(T) = ay 0 sinh(o:T) = aJp - y$= yfglJ 1 - rfi, (2.90) 


where rjo = Vo /I is the initial fraction hanging below the hole. 

If r ]o w 0, then the speed at time T is \fgl (this quickly follows from conservation of 
energy, which is the subject of Chapter 4). Also, you can show that eq. (2.88) implies 
that T goes to infinity logarithmically as % —> 0. 


Second method: Write y as vdv/dy in eq. (2.86), and then separate variables and 
integrate to obtain 


/ vdv = a 2 y 

Jo Jy 0 


= a 2 (y 2 -y 2 ), 


(2.91) 
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where a = y/gjl. Now write v as dy/dt and separate variables again to obtain 



The integral on the left-hand side is cosh 1 (j//j/o), so we arrive at 

y(t) = (/ocosh(at), (2.93) 


in agreement with eq. (2.88). The solution proceeds as above. However, an easier 
way to obtain the final speed with this method is to simply use the result for v in 
eq. (2.91). This tells us that the speed of the chain when it leaves the table (that is, 
when y = £) is v = ay/1 2 — y$, in agreement with eq. (2.90). 

11. Circling around a pole 

Let F be the tension in the string. At the mass, the angle between the string and the 
radius of the dotted circle is 6 = sin -1 (r/i?). In terms of 6, the radial and tangential 
F = ma equations are 


FcosO 
F sin6> 


(2.94) 


Dividing these two equations gives tan 9 = {Rv)/v 2 . Separating variables and inte¬ 
grating gives 


F,% - 


v(t) = 


tan 6 f , 

-Trl dt 

(tanO)t 

R 

/ 1 (tan#)f \ -1 

lu 0 R J 


(2.95) 


Remark: Note that v becomes infinite when 
t = T = — 


In other words, you can keep the mass moving in the desired circle only up to time T. After 
that, it is impossible. (Of course, it will become impossible, for all practical purposes, long 
before v becomes infinite.) The total distance, d = J vdt, is infinite, because this integral 
diverges (barely, like a log) as t approaches T. $ 


12. Throwing a beach ball 

On both the way up and the way down, the total force on the ball is 


F = 


—mg — mav. 


(2.97) 


On the way up, v is positive, so the drag force points downward, as it should. And 
on the way down, v is negative, so the drag force points upward. 

Our strategy for finding Vf will be to produce two different expressions for the maxi¬ 
mum height, h, and then equate them. We’ll find these two expressions by considering 
the upward and then the downward motion of the ball. In doing so, we will need to 
write the acceleration of the ball as a = vdv/dy. 
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For the upward motion, F = ma gives 


—mg — mom 


dv 


vdv 
g + av ' 


(2.98) 


where we have taken advantage of the fact that we know that the speed of the ball 
at the top is zero. Writing v/(g + av) as [1 — g/{g + av)]/a, we may evaluate the 
integral to obtain 


h 


1 + — 


(2.99) 


Now let us consider the downward motion. Let Vf be the final speed, which is a positive 
quantity. The final velocity is then the negative quantity, —Vf. Using F = ma, we 
similarly obtain 

[°dy = - [ **'—-■ (2.100) 

Jh Jo 9 A av 

Performing the integration (or just replacing the Vo in eq. (2.99) with —Vf) gives 

(2.101) 

a a 2 V 9 J 

Equating the expressions for h in eqs. (2.99) and (2.101) gives an implicit equation 
for Vf in terms of vo, 

v 0 + Vf = !J In ( <J + aV ° ^ . (2.102) 

a \g-avfj 


Remarks: In the limit of small a (more precisely, in the limit avo/g 1), we can use 
ln(l + x) = x — x 2 /2 + • • • to obtain approximate values for h in eqs. (2.99) and (2.101). The 
results are, as expected, 

h&—, and h «. (2.103) 

2 9 2 g ’ 

We can also make approximations for large a (or large avo/g). In this limit, the log term 
in eq. (2.99) is negligible, so we obtain h « vo/a. And eq. (2.101) gives Vf « g/a, because 
the argument of the log must be very small in order to give a very large negative number, 
which is needed to produce a positive h on the left-hand side. There is no way to relate Vf 
and h is this limit, because the ball quickly reaches the terminal velocity of —g/a (which is 
the velocity that makes the net force equal to zero), independent of h. 4* 

Let’s now find the times it takes for the ball to go up and to go down. We’ll present 
two methods for doing this. 


First method: Let Ti be the time for the upward path. If we write the acceleration 
of the ball as a = dv/dt, then F = ma gives 


—mg — mav 



dv 

n— 

dt 


-i: 


dv 

g + av 


(2.104) 
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In a similar manner, we find that the time T2 for the downward path is 




a \g — OLVf ) 


Using eq. (2.102), we have 


This is shorter than the time in vacuum (namely 2 vq /g) because Vf < vq. 


(2.105) 


(2.106) 


(2.107) 

(2.108) 


Second method: The very simple form of eq. (2.108) suggests that there is a 
cleaner way to calculate the total time of flight. And indeed, if we integrate mdv/dt = 
—mg — mav with respect to time on the way up, we obtain —Vq = —gT-\ — ah (because 
f vdt = h). Likewise, if we integrate mdv/dt = —mg — mav with respect to time 
on the way down, we obtain —Vf = —gT% + ah (because f vdt = —h). Adding these 
two results gives eq. (2.108). This procedure only works, of course, because the drag 
force is proportional to v. 


Remarks: The fact that the time here is shorter than the time in vacuum isn’t obvious. 
On one hand, the ball doesn’t travel as high in air as it would in vacuum (so you might think 
that Ti + T2 < 2vo/g). But on the other hand, the ball moves slower in air (so you might 
think that Ti +T2 > 2vo/g). It isn’t obvious which effect wins, without doing a calculation. 
For any a, you can use eq. (2.105) to show that Ti < vo/g. But T 2 is harder to get a handle 
on, because it is given in terms of Vf. But in the limit of large a, the ball quickly reaches 
terminal velocity, so we have T 2 « h/vf « (vo/a)/(g/a) = vo/g. Interestingly, this is the 
same as the downward (and upward) time for a ball thrown in vacuum. £ 

13. Balancing a pencil 


(a) The component of gravity in the tangential direction is mg sin 0 « mg9. There¬ 
fore, the tangential F = ma equation is mgO = mid, which may be written as 
0 = (g/£)6. The general solution to this equation is 20 

0(t) = Ae t / T + where t = \flfg. (2.109) 

The constants A and B are found from the initial conditions, 

0(0) =0 O =► A + B = 0 O , 

0(0) = w 0 => {A-B)/t = u 0. (2.110) 

Solving for A and B, and then plugging them into eq. (2.109) gives 

0{t) = ^ (0 O + u 0 t) e t/T + ^ (6> 0 - u 0 t) e~ t/r . (2.111) 

20 If you want, you can derive this by separating variables and integrating. The solution is 
essentially the same as in the second method presented in the solution to Problem 10. 



11-38 


CHAPTER 2. USING F = MA 


(b) The constants A and B will turn out to be small (they will each be of order 
Vh). Therefore, by the time the positive exponential has increased enough to 
make 0 of order 1, the negative exponential will have become negligible. We will 
therefore ignore the latter term from here on. In other words, 

6{t) « i (0 O + w 0 r) e*/ T . (2.112) 

The goal is to keep 6 small for as long as possible. Hence, we want to minimize 
the coefficient of the exponential, subject to the uncertainty-principle constraint, 
(£6o){mIuJo) > h. This constraint gives wo > h/(ml 2 6o)- Therefore, 

<2113) 

Taking the derivative with respect to 6o to minimize the coefficient, we find that 
the minimum value occurs at 



Substituting this back into eq. (2.113) gives 



(2.114) 


(2.115) 


Setting 6 « 1 , and then solving for t gives (using r = y/l/g) 



(2.116) 


With the given values, m = 0.01kg and I = 0.1m, along with g = 10 m/s 2 and 
h = 1.06 • 10 -34 Js, we obtain 

t < i (0.1 s) ln(9 • 10 61 ) w 3.5 s. (2.117) 

No matter how clever you are, and no matter how much money you spend on 
the newest, cutting-edge pencil-balancing equipment, you can never get a pencil 
to balance for more than about four seconds. 


Remarks: This smallness of this answer is quite amazing. It is remarkable that a 
quantum effect on a macroscopic object can produce an everyday value for a time scale. 
Basically, the point here is that the fast exponential growth of 6 (which gives rise to 
the log in the final result for t) wins out over the smallness of h. and produces a result 
for t of order 1. When push comes to shove, exponential effects always win. 

The above value for t depends strongly on l and g, through the y/l/g term. But the 
dependence on m, £, and g in the log term is very weak. If m were increased by a 
factor of 1000, for example, the result for t would increase by only about 10%. Note 
that this implies that any factors of order 1 that we neglected throughout this problem 
are completely irrelevant. They will appear in the argument of the log term, and will 
thus have negligible effect. 

Note that dimensional analysis, which is gen erally a very powerful tool, won’t get you 
too far in this problem. The quantity \J£/g has dimensions of time, and the quantity 
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r) = m 2 £ 3 g/h 2 is dimensionless (it is the only such quantity), so the balancing time 
must take the form, 

i * yj ( g M-. (2-118) 

where / is some function. If the leading term in / were a power (even, for example, a 
square root), then t would essentially be infinite (t ~ 10 30 s for the square root). But / 
in fact turns out to be a log (which you can’t determine without solving the problem), 
which completely cancels out the smallness of %, reducing an essentially infinite time 
down to a few seconds. & 

14. Throwing a ball from a cliff 

Let the inclination angle be 9. Then the horizontal speed is v x = v cos 9, and the 
initial vertical speed is v y = v sin 9. The time it takes for the ball to hit the ground 
is given by h + (v sin 6)t — gt 2 /2 = 0. Therefore, 

t = - ^sin9 + \J sin 2 0 + ,6 ^ , where (3 = . (2.119) 

(The ” solution for t from the quadratic formula corresponds to the ball being 
thrown backwards down through the cliff.) The horizontal distance traveled is d = 
( v cos 9)t, which gives 


- cos 9 sin 6 + \ sin 2 9 + /3 


We want to maximize this function of 9. Taking the derivative, multiplying through 
by \J sin 2 9 + (3, and setting the result equal to zero, gives 

(cos 2 9 - sin 2 9)\Jsin 2 9 +(3 = sin 9(/3 - (cos 2 9 - sin 2 9)). (2.121) 

Using cos 2 9=1 — sin 2 9, and then squaring and simplifying this equation, gives an 
optimal angle of 

sin6» max = . 1 = = . 1 . (2.122) 

V2 + (3 i/2 + 2 gh/v 2 

Plugging this into eq. (2.120), and simplifying, gives a maximum distance of 


Remarks: If h = 0, then we obtain 0 ma x = 7r/4 and d max = v 2 /g, in agreement with the 
example in Section 2.4. If h —> oo or v —> 0, then 9 « 0, which makes sense. 

If we make use of conservation of energy (discussed in Chapter 4), it turns out that the final 
speed of the ball when it hits the ground is Vf = sjv' 2 + 2gh. The maximum distance in eq. 
(2.123) may therefore be written as (with Vi=v being the initial speed) 

d max (2.124) 

Note that this is symmetric in Vi and Vf , as it must be, because we could imagine the 
trajectory running backwards. Also, it equals zero if Vi is zero, as it should. We can also 
write the angle 9 in eq. (2.122) in terms of Vf (instead of h). You can show that the result 
is tan# = Vi/vf. You can further show that this implies that the initial and final velocities 
are perpendicular to each other. The simplicity of all these results suggests that there is an 
easier way to derive them, but I have no clue what it is. & 
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15. Redirected motion 

First Solution: We will use the results of Problem 14, namely eqs. (2.123) and 
(2.122), which say that an object projected from height y at speed v travels a maxi¬ 
mum horizontal distance of 



and the optimal angle yielding this distance is 


sin# = 


y/2 + 2gy/v 2 ' 


(2.125) 


(2.126) 


In the problem at hand, the object is dropped from a height h, so conservation of 
energy (or integration of mvdv/dy = —mg) says that the speed at height y is 

v = y/2 g(h-y). (2.127) 

Plugging this into eq. (2.125) shows that the maximum horizontal distance, as a 
function of y, is 

d m M = 2 y/h(h-y). (2.128) 

This is maximum when y = 0, in which case the distance is d mlrx = 2 h. Eq. (2.126) 
then gives the associated optimal angle as 6 = 45°. 


Second Solution: Assume that the greatest distance, do, is obtained when y = yo 
and 9 = 9o■ And let the speed at yo be vq■ We will show that yo must be 0. We will 
do this by assuming that yo ^ 0 and explicitly constructing a situation that yields a 
greater distance. 

Consider the situation where the ball falls all the way down to y = 0 and then bounces 
up at an angle such that when it reaches the height yo, it is traveling at an angle 9o 
with respect to the horizontal. When it reaches the height yo, the ball will have speed 
Vo (by conservation of energy), so it will travel a horizontal distance do from this point. 
But the ball already traveled a nonzero horizontal distance on its way up to the height 
yo- We have therefore constructed a situation that yields a distance greater than d 0 . 
Hence, the optimal setup must have yo = 0. Therefore, the maximum distance is 
obtained when y = 0, in which case the example in Section 2.4 says that the optimal 
angle is 9 = 45°. 

If we want the ball to go even further, we can simply dig a (wide enough) hole in the 
ground and have the ball bounce from the bottom of the hole. 

16. Maximum trajectory length 

Let 9 be the angle at which the ball is thrown. Then the coordinates are given by 
x = (v cos 9)t and y = (v sin 6)t — gt 2 /2. The ball reaches its maximum height at 
t = usin 9/g, so the length of the trajectory is 


L 




y/(vcos9) 2 + (usin# — gt ) 2 dt 


2v cos 9 



tan 9 — 


9t V 
v cos 9 J 


dt. 


(2.129) 
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Letting 2 = tan 9 — gt/v cos 6, we obtain 
r 2v 2 cos 2 9 


L 


(2.130) 


We can either look up this integral, or we can derive it by making a z = sinh a 
substitution. The result is 


L 



(2.131) 


As a double-check, you can verify that L = 0 when 9 = 0, and L = v 2 /g when 9 = 90°. 
Taking the derivative of eq. (2.131) to find the maximum, we obtain 


0 = cos 9 — 2 cos 9 sin 

This reduces to 


/1 + sin 9 
V cos 9 


f cos 2 9 


\ 1 + sin 
/1 + sin 9 \ 


cos 9 ^ cos 2 9 + (1 + sin 9) sin 9 
cos 2 9 


(2.132) 

1 = sinIn ^ ' qqsQ “ J • (2.133) 

Finally, you can show numerically that the solution for 9 is 9q « 56.5°. 


Remark: A few possible trajectories are shown Fig. 2.30. Since it is well known that 
9 = 45° provides the maximum horizontal distance, it follows from the figure that the Qo 
yielding the arc of maximum length must satisfy 9o > 45°. The exact angle, however, requires 
the above detailed calculation. 4 



Figure 2.30 


17. Maximum trajectory area 

Let 9 be the angle at which the ball is thrown. Then the coordinates are given by 
x = (ucos 9)t and y = (usin#)f — gt 2 /2. The total time in the air is 2(rsin 9)/g, so 
the area under the trajectory is 


A 


Taking the derivative, 
when 



” sin 3 9 cos 9. 
3g 2 


v cos 9 dt 


(2.134) 


find that the maximum occurs when tan 9 = \/3, that is, 


9 = 60°. 


(2.135) 


The maximum area is then A max = \/3v 4 /8g 2 . Note that by dimensional analysis, we 
know that the area, which has dimensions of distance squared, must be proportional 
to v 4 /g 2 . 
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Figure 2.31 

Av 



18. Bouncing ball 

The ball travels 2 h during the first up-and-down journey. It travels 2 hf during the 
second, then 2 hf 2 during the third, and so on. Therefore, the total distance traveled 
is 


D 


2h(l + / + / 2 + f + ■ • •) 
2 h 

~r 


(2.136) 


The time it takes to fall down during the first up-and-down is obtai ned from h = gt 2 / 2. 
Therefore, the time for the first up-and-down equals 2t = 2\j2hfg. Likewise, the time 
for the second up-and-down equals 2y/2(hf)/g. Each successive up-and-down time 
decreases by a factor of \ff, so the total time is 


T 


2 

2 


l2h 1 

yy i-v7‘ 


The average speed equals 


T itW 


(2.137) 


(2.138) 


Remark: The average speed for / « 1 is roughly half of the average speed for / « 0. This 
may seem somewhat counterintuitive, because in the / « 0 case the ball slows down far more 
quickly than in the / « 1 case. But the / « 0 case consists of essentially only one bounce, 
and the average speed for that one bounce is the largest of any bounce. Both D and T are 
smaller for / * 0 than for /« 1. but T is smaller by a larger factor. £ 


19. Centripetal acceleration 


The position and velocity vectors at two nearby times are shown in Fig. 2.31. Their 
differences, Ar = r2 — ri and Av = V2 — Vi, are shown in Fig. 2.32. The angle 
between the v’s is the same as the angle between the r’s, because each v makes a 
right angle with the corresponding r. The triangles in Fig. 2.32 are therefore similar, 
so we have 


|Av| _ |Ar| 


(2.139) 


where r = |r| and v = |v|. Dividing eq. (2.139) through by At gives 


1 Av 

1 Ar 


N = M 


v 2 

a = — 

v At 

~ r ~At 


v r 




We have assumed that At is infinitesimal here, which allows us to get 
in favor of instantaneous quantities. 


(2.140) 
rid of the A’s 


20. Free particle 

For zero force, eqs. (2.52) give 


r = r6 2 , 

r'6 = -2 rd. (2.141) 


Figure 2.32 
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Separating variables in the second equation and integrating yields 


In 0 = —2 lnr + C 


9=° 2 , (2.142) 


where D = e c is a constant of integration, determined by the initial conditions. 21 
Substituting this value of 6 into the first of eqs. (2.141), and then multiplying both 
sides by r and integrating, gives 


(?) 




2 


D 2 
2 r 2 


(2.143) 


We want r = 0 when r = ro, which implies that E = D 2 /2r ( 2 . Therefore, 



(2.144) 


where V = D/tq. Separating variables and integrating gives 


=* y/r*-r$ = Vt =* r = ^r 2 + (Ff) 2 , 

(2.145) 

where the constant of integration is zero, because we have chosen t = 0 to correspond 
with r = r 0 . Plugging this value for r into the 0 = D/r' 2 = Vr 0 /r 2 result in eq. 
(2.142) gives 


Finally, combining this with the result for r in eq. 
desired. 


=>■ cos 9 = — . = . 

V r o + ( vt ) 2 

(2.146) 

(2.145) gives cos 6 = ro/r, as 


21. A force Fg = r6 

With the given force, eqs. (2.52) become 

0 = m(r — rO 2 ), 

mf9 = m(r0 + 2f9). (2.147) 

The second of these equations gives — r6 = rO. Therefore, 

ln<9 = —In r + C => 9=—, (2.148) 

where D = e c is a constant of integration, determined by the initial conditions. 
Substituting this value of 6 into the first of eqs. (2.147), and then multiplying both 
sides by r and integrating, gives 





2 


= D 2 lnr + E. 


(2.149) 


Therefore, 

f = VAlnr + B, (2.150) 

where A = 2D 2 and B = 2 E. 


21 The statement that r 2 0 is constant is simply the statement of conservation of angular momen¬ 
tum, because r 2 d = r(r0) = rve- More on this in Chapters 6 and 7. 
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Chapter 3 


Oscillations 

Copyright 2004 by David Morin, morin@physics.harvard.edu 

In this chapter we will discuss oscillatory motion. The simplest examples of such 
motion are a swinging pendulum and a mass on a spring, but it is possible to make a 
system more complicated by introducing a damping force and/or an external driving 
force. We will study all of these cases. 

We are interested in oscillatory motion for two reasons. First, we study it because 
we can study it. This is one of the few systems in physics where we can solve the 
motion exactly. There’s nothing wrong with looking under the lamppost every now 
and then. Second, oscillatory motion is ubiquitous in nature, for reasons that will 
become clear in Section 4.2. If there was ever a type of physical system worthy of 
study, this is it. 

We’ll jump right into some math in Section 3.1. And then in Section 3.2 we’ll 
show how the math is applied to the physics. 

3.1 Linear differential equations 

A linear differential equation is one in which x and its time derivatives enter only 
through their first powers. An example is 3x+7x+x = 0. An example of a nonlinear 
differential equation is 3x + 7x 2 + x = 0. 

If the right-hand side of the equation is zero, then we use the term homogeneous 
differential equation. If the right-hand side is some function of t, as in the case of 
3x — 4x = 9t 2 — 5, then we use the term inhomogeneous differential equation. The 
goal of this chapter is to learn how to solve these two types of equations. Linear 
differential equations come up again and again in physics, so we had better find a 
systematic method of solving them. 

The techniques that we will use are best learned through examples, so let’s 
solve a few differential equations, starting with some simple ones. Throughout this 
chapter, x will be understood to be a function of t. Hence, a dot will denote time 
differentiation. 


Example 1 (x = ax): This is a very simple differential equation. There are two 
ways (at least) to solve it. 
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First method: Separate variables to obtain dx/x = adt, and then integrate to 
obtain In x = at+ c. Exponentiate to obtain 


x = Ae at 


(3.1) 


where A = e c is a constant factor. A is determined by the value of x at, say, t = 0. 

Second method: Guess an exponential solution, that is, one of the form x = Ae at . 
Substitution into x = ax immediately gives a = a. Therefore, the solution is x = 
Ae at . Note that we can’t solve for A, due to the fact that our differential equation is 
homogeneous and linear in x (translation: A cancels out). A is determined from the 
initial condition. 

This method may seem a bit silly, and somewhat cheap. But as we will see below, 
guessing these exponential functions (or sums of them) is actually the most general 
thing we can try, so the method is indeed quite general. 

Remark: Using this method, you may be concerned that although we have found one 
solution, we might have missed another one. But the general theory of differential equations 
says that a first-order linear equation has only one independent solution (we’ll just accept 
this fact here). So if we find one solution, then we know that we’ve found the whole thing. 

* 


Example 2 (x = ax): If a is negative, then this equation describes the oscillatory 
motion of, say, a spring. If a is positive, then it describes exponentially growing or 
decaying motion. There are two ways (at least) to solve this equation. 

First method: We can use the separation-of-variables method of Section 2.3 here, 
because our system is one in which the force depends on only the position x. But this 
method is rather cumbersome, as you found if you did Exercise 2.10 or 2.11. It will 
certainly work, but in the case where our equation is a linear function of x, there is 
a much simpler method: 

Second method: As in the first example above, we can guess a solution of the form 
x(t) = Ae at and then find out what a must be. Again, we can’t solve for A, because 
it cancels out. Plugging Ae at into x = ax gives a = ±.y/a. We have therefore found 
two solutions. The most general solution is an arbitrary linear combination of these, 

x(t) = Ae^ 1 + Be~^\ (3.2) 

as you can quickly check. A and B are determined from the initial conditions. 

Very Important Remark: The fact that the sum of two different solutions is again a so¬ 
lution to our equation is a monumentally important property of linear differential equations. 
This property does not hold for nonlinear differential equations, for example x 2 = x, because 
the act of squaring after adding the two solutions produces a cross term which destroys the 
equality, as you should check. 

This property is called the principle of superposition. That is, superimposing two solutions 
yields another solution. This quality makes theories in physics that are governed by linear 
equations much easier to deal with than those that are governed by nonlinear ones. General 
Relativity, for example, is permeated with nonlinear equations, and solutions to most General 
Relativity systems are extremely difficult to come by. 
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For equations with one main condition 
(Those linear), you have permission 
To take your solutions, 

With firm resolutions, 

And add them in superposition. & 

Let’s say a little more about the solution in eq. (3.2). If a is negative, then let’s define 
a = —u> 2 , where ui is a real number. The solution now becomes x{t) = Ae lut + Be~ lu>t . 
Using e ie = cos 0 + i sin 9, this can be written in terms of trig functions, if desired. 
Various ways of writing the solution are: 


x(t) 

— Aai^ -(- Be 


x(t) 

= Ceos ujt + D sin ut, 


x(t) 

= Ecos(u>t + 0i), 


x(t) 

= F sin(ut + fa) ■ 

(3.3) 


The various constants here are related to each other. For example, C = E cos ci)-\ 
and D = —E sin <p A , which follow from the cosine sum formula. Note that there 
are two free parameters in each of the above expressions for x(t). These parameters 
are determined from the initial conditions (say, the position and speed at t • 0). 
Depending on the specifics of a given problem, one of the above forms will work 
better than the others. 

If a is positive, then let’s define a = u> 2 , where u> is a real number. The solution in 
eq. (3.2) now becomes x(t) = Ae ut + Be~ wt . Using e e = cosh 6 + sinli 6, this can be 
written in terms of hyperbolic trig functions, if desired. Various ways of writing the 
solution are: 

x(t) = Ae ut + Be~ ut 
x(t) = CcoshoA + -Dsinhwt, 
x(t) = Ecosh(u>t + 

x(t) = /•’sinh(u;t + tfe). (3.4) 

Again, the various constants are related to each other. If you are unfamiliar with the 
hyperbolic trig functions, a few facts are listed in Appendix A. 

Remarks: Although the solution in eq. (3.2) is completely correct for both signs of a, it is 
generally more illuminating to write the negative-a solutions in either the trig forms or the 
e ±l “ t exponential form where the i’s are explicit. 

As in the first example above, you may be concerned that although we have found two 
solutions to the equation, we might have missed others. But the general theory of differential 
equations says that our second-order linear equation has only two independent solutions. 
Therefore, having found two independent solutions, we know that we’ve found them all. 

The usefulness of this method of guessing exponential solutions cannot be overempha¬ 
sized. It may seem somewhat restrictive, but it works. The examples in the remainder 
of this chapter should convince you of this. 

This is our method, essential, 

For equations we solve, differential. 

It gets the job done, 
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And it’s even quite fun. 

We just try a routine exponential. 


Example 3 (x + 2-yx + ax = 0): This will be our last mathematical example, and 
then we’ll start doing some physics. As we will see later, this example pertains to a 
damped harmonic oscillator. We have put a factor of 2 in the coefficient of x here to 
make some later formulas look nicer. 

Note that the force in this example (if we allow ourselves to think physically for a 
moment) is —2jx — ax (times m), which depends on both v and x. Our methods of 
Section 2.3 therefore don’t apply. This leaves us with only our method of guessing an 
exponential solution, Ae at . Plugging this into the given equation, and cancelling the 
nonzero factor of Ae at , gives 

a 2 + 27a + a = 0. (3.5) 

The solutions for a are _ 

-7 ± vV - a. (3.6) 

Call these a\ and a2- Then the general solution to our equation is 
x(t) = Ae ait + Be a2t 

= e (/hV v ' ! " + Be 's/P- a y. ( 3 . 7 ) 

Well, well, our method of trying Ae at doesn’t look so trivial anymore... 

If 7 2 — a < 0, then we can write our answer in terms of sines and cosines, so we have 
oscillatory motion that decreases in time due to the e -7t factor (or it increases, if 
7 < 0, but this is rarely physical). If j 2 — a > 0, then we have exponential motion. 
We’ll talk more about these different possibilities in Section 3.3. 


In general, if we have an n-th order homogeneous linear differential equation, 


d n x d n l x dx 

d^ + c "- 1 ^r + '" + Cl * 


CoX 


0, 


(3.8) 


then our strategy is to guess an exponential solution, x(t) = Ae at , and to then (in 

theory) solve the resulting nth order equation (namely a n + c n -\a n ~ x H-1- c\a + 

co = 0) for a, to obtain the solutions a 1 ...., a n . The general solution for x(t) is 
then 

x(t) = A ie ait + A 2 e a2t + • • • + A n e ant , (3.9) 


where the Ai are determined from the initial conditions. In practice, however, we 
will rarely encounter differential equations of degree higher than 2. Note: if some of 
the ai happen to be equal, then eq. (3.9) is not valid, so a modification is needed. 
We will encounter such a situation in Section 3.3. 


3.2 Simple harmonic motion 

Let’s now do some real live physical problems. We’ll start with simple harmonic 
motion. This is the motion undergone by a particle subject to a force F(x) = —kx. 
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The classic system that undergoes simple harmonic motion is a mass attached 
to a spring (see Fig. 3.1). A typical spring has a force of the form F(x) = —kx, 
where x is the displacement from equilibrium. This is “Hooke’s law,” and it holds 
as long as the spring isn’t stretched too far; eventually this expression breaks down 
for any real spring. Assuming a —kx force, F = ma gives —kx = mx, or 

x + w 2 x = 0, where u = \ —. (3.10) 

V m 

This is simply the equation we studied in Example 2 in the previous section. From 
eq. (3.3), the solution to may be written as 

x(t) = Acos(u>t + (f>), (3-11) 

where A and <f> are determined from the initial conditions. This trig solution shows 
that the system will oscillate back and forth forever in time. 

Remark: The constants A and <j> are determined from the initial conditions. If, for 
example, a;(0) = 0 and £(0) = v, then we must have Acos<f> = 0 and — Ausmcj) = v. Hence, 
(j) = 7r/2, and A = —v/uj. Therefore, the solution is x(t) = — (v/uj) cos(ujt + n/2). This 
looks a little nicer if we write it as x(t) = (v/uj) sin(x't). So, given these initial conditions, 
we could have arrived at this result a little quicker if we had chosen the “sin” solution in 
eq. (3.3). * 


Figure 3.1 


Example (Simple pendulum): Another classic system that undergoes (approxi¬ 
mately) simple harmonic motion is the simple pendulum, that is, a mass that hangs 
on a massless string and swings in a vertical plane. 

Let I be the length of the string, and let 0 be the angle the string makes with the 
vertical (see Fig. 3.2). Then the gravitational force on the mass in the tangential 
direction is —mg sin 6*. So F = ma in the tangential direction gives 

—mgsinO = m(l8) (3-12) 

The tension in the string exactly cancels the radial component of gravity, so the radial 
F = ma serves only to tell us the tension, which we won’t need here. 

We will now enter the realm of approximations and assume that the amplitude of the 
oscillations is small. Without this approximation, the problem cannot be solved in 
closed form. Assuming 6 is small, we can use sin# « 9 in eq. (3.12) to obtain 




Figure 3.2 
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F f 

Figure 3.3 


e -y‘ 



Figure 3.4 


3.3 Damped harmonic motion 

Consider a mass m attached to the end of a spring with spring constant k. Let the 
mass be subject to a drag force proportional to its velocity, Ff = —bv; see Fig. 3.3. 1 
What is the position as a function of time? 2 

The force on the mass is F = — bx — kx. So F = mx gives 

x + 2yx + uj 2 x = 0, (3.15) 

where 2y = b/m , and co = y/k/m. But this is exactly the equation we solved in 
Example 3 in Section 3.1 (with a —* oj 2 ). Now, however, we have the physical 
restrictions that 7 > 0 and co 2 > 0. Letting fi 2 = j 2 — oj 2 for simplicity, we may 
write the solution in eq. (3.7) as 

x(t) = e~ 7t (Ae nt + Be~ nt ) , where 0 2 = 7 2 -co 2 . (3.16) 

There are three cases to consider. 

Case 1: Underdamping (fl 2 < 0) 

In this case, to > 7. Since LI is imaginary, let us define the real number Co = 
y/oo 2 — 7 2 , so that LI = iCo. Eq. (3.16) then gives 

x(t) = e _7t ( Ae^ + Be~ lu>t ^j 

= e-^Ccosiut + 4)). (3.17) 

These two forms are equivalent. Depending on the given problem, one of these 
expressions will inevitably work better than the other. Or perhaps one of the other 
forms in eq. (3.3) (times e _7t ) will be the most useful one. 

Using e l6 = cos 9 + isin0, the constants in eq. (3.17) are related by A + B = 
C cos 4> and A — B = iC sin 6. In a physical problem, x(t) is real, so we must have 
A* = B. where the star denotes complex conjugation. The two constants A and B, 
or C and (/>, are determined from the initial conditions. 

The cosine form makes it apparent that the motion is harmonic motion whose 
amplitude decreases in time, due to the e -7 * factor. A plot of such motion is shown 
in Fig. 3.4. Note that the frequency of the motion, Co = yfup —7 2 , is less than the 
natural frequency, 00 , of the undamped oscillator. 

Remarks: If 7 is very small, then Co w to, which makes sense because we almost have 
an undamped oscillator. If 7 is very close to u>, then Cu « 0. So the oscillations are very 

1 The subscript / stands for “friction” here. We’ll have to save the letter d for “driving” in the 
next section. 

2 We choose to study this Ff = —bv damping force because (1) it is linear in x, which will allow 
us to solve for the motion, and (2) it is a perfectly realistic force; an object moving at a slow speed 
through a fluid will generally experience a drag force proportional to its velocity. Note that this 
Ff = —bv force is not the force that a mass would feel if it were placed on a table with friction. In 
that case the drag force would be (roughly) constant. 






3.3. DAMPED HARMONIC MOTION 


III-7 


slow. Of course, for very small u> it is hard to even tell that the oscillations exist, because 
they will damp out on a time scale of order 1/7, which will be short compared to the long 
time scale of the oscillations, l/d>. X 


Case 2: Overdamping (O 2 > 0) 

In this case, uj < 7. 0 is real (and taken to be positive), so eq. (3.16) gives 

x{t) = Ae-b-** + Be-^+W. (3.18) 

There is no oscillatory motion in this case; see Fig. 3.5. Note that 7 > fi = 
y/j 2 — uj 2 , so both of the exponents are negative. The motion therefore goes to 
zero for large t. This had better be the case, because a real spring is certainly not 
going to have the motion head off to infinity. If we had obtained a positive exponent 
somehow, we’d know we had made a mistake. 

Remarks: If 7 is just slightly larger than u>, then Q fn 0, so the two terms in (3.18) are 
roughly equal, and we essentially have exponential decay, according to e _7 h If 7 » 10 (that 
is, strong damping), then « 7, so the first term in (3.18) dominates, and we essentially 
have exponential decay according to e _ N- 0 ) 4 . We can be somewhat quantitative about 
this by approximating Q. as Cl = y^y 2 — w 2 = — u> 2 /7 2 w 7(1 — w 2 /27 s ). Hence, 

the exponential behavior goes like e~ u t ^ 1 . This is slow decay (that it, slow compared to 
t ~ 1/w), which makes sense if the damping is very strong. The mass slowly creeps back to 
the origin, as in the case of a weak spring immersed in molasses. X 



Figure 3.5 


Case 3: Critical damping (fl 2 = 0) 

In this case, 7 = 0;. Eq. (3.15) therefore becomes x + 2jx + ~/ 2 x = 0. In this special 
case, we have to be careful in solving our differential equation. The solution in eq. 
(3.16) is not valid, because in the procedure leading to eq. (3.7), the roots aq and 
a2 are equal (to —7), so we have really found only one solution, e~ jt . We’ll just 
invoke here the result from the theory of differential equations that says that in this 
special case, the other solution is of the form te _7 *. 

Remark: You should check explicitly that te~ Jt solves the equation x + 2 jx + j' 2 x = 0 . 
Or if you want to, you can derive it in the spirit of Problem 1. In the more general case 
where there are n identical roots in the procedure leading to eq. (3.9) (call them all a), 
the n independent solutions to the differential equation are t k e at , for 0 < k < (n — 1). But 
more often than not, there are no repeated roots, so you don’t have to worry about this. X 

Our solution is therefore of the form 

x(t) = e-^iA + Bt). (3.19) 

The exponential factor eventually wins out over the Bt term, so the motion goes to 
zero for large t (see Fig. 3.6). 

If we are given a spring with a fixed u>, and if we look at the system for different 
values of 7, then critical damping (when 7 = u) is the case where the motion 



Figure 3.6 
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converges to zero in the quickest way (which is like e~ ujt ). This is true because in 
the underdamped case (7 < u), the envelope of the oscillatory motion goes like e~ lf , 
which goes to zero slower than e~ wt , because 7 < u>. And in the overdamped case 
(7 > u), the dominant piece is the term. And as you can verify, if 7 > u> 

then 7 — ^ = 7 — — j§P-< w, so this motion also goes to zero slower than e~ ut . 

Critical damping is very important in many real systems, such as screen doors 
and shock absorbers, where the goal is to have the system head to zero (without 
overshooting and bouncing around) as fast as possible. 


3.4 Driven (and damped) harmonic motion 


Before we examine driven harmonic motion, we must learn how to solve a new type 
of differential equation. How can we solve something of the form 

x + 27a; + ax = Coe^ 0 *, (3.20) 


where 7, a, ujq, and C'q are given quantities? This is an inhomogeneous differential 
equation, due to the term on the right-hand side. It’s not very physical, because the 
right-hand side is complex, but let’s not worry about this for now. Equations of this 
sort will come up again and again, and fortunately there is a nice and easy (although 
sometimes messy) method for solving them. As usual, the method involves making 
a reasonable guess, plugging it in, and seeing what condition comes out. 

Since we have the e lWot sitting on the right-hand side of eq. (3.20), let’s guess 
a solution of the form x(t) = Ae lu>ot . A will depend on ujo, among other things, as 
we will see. Plugging this guess into eq. (3.20) and cancelling the non-zero factor 
of e llJJot , we obtain 

(— u>q^A T 2j(iujo)A -\- clA = Co- (3.21) 

Solving for A, we find that our solution for x is 


x(t ) = 


C'q 


I- 2i^fuj 0 


(3.22) 


Note the differences between this technique and the one in Example 3 in Section 
3.1. In that example, the goal was to determine the a in x(t ) = Ae at . And there 
was no way to solve for A; the initial conditions determined A. But in the present 
technique, the ujo in x(t) = Ae VjJot is a given quantity, and the goal is to solve for 
A in terms of the given constants. Therefore, in the solution in eq. (3.22), there 
are no free constants to be determined from the initial conditions. We’ve found one 
particular solution, and we’re stuck with it. The term particular solution is what 
people use for eq. (3.22). 

With no freedom to adjust the solution in eq. (3.22), how can we satisfy an 
arbitrary set of initial conditions? Fortunately, eq. (3.22) does not represent the 
most general solution to eq. (3.20). The most general solution is the sum of our 
particular solution in eq. (3.22), plus the “homogeneous” solution we found in 
eq. (3.7). This sum is certainly a solution, because the solution in eq. (3.7) was 
explicitly constructed to yield zero when plugged into the left-hand side of eq. (3.20). 
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Therefore, tacking it onto our particular solution doesn’t change the equality in eq. 
(3.20), because the left side is linear. The principle of superposition has saved the 
day. The complete solution to eq. (3.20) is therefore 

*( t ) - + Be -V^=) + ( _^ + ^ o + J (3.23) 

where A and B are determined from the initial conditions. 

With superposition in mind, it is clear what the strategy should be if we have a 
slightly more general equation to solve, for example, 

x + 2- f x + ax = C 1 e iu,lt + C 2 e^ 2t . (3.24) 

Simply solve the equation with only the first term on the right. Then solve the 
equation with only the second term on the right. Then add the two solutions. And 
then add on the homogeneous solution from eq. (3.7). We are able to apply the 
principle of superposition because the left-hand side of eq. (3.24) is linear. 

Finally, let’s look at the case where we have many such terms on the right-hand 
side, for example, 

N 

x + 2jx + ax = y~) C n e luJnt . (3.25) 

We simply have to solve N different equations, each with only one of the N terms 
on the right-hand side. Then we add up all the solutions, and then we add on the 
homogeneous solution from eq. (3.7). If N is infinite, that’s fine; we’ll just have to 
add up an infinite number of solutions. This is the principle of superposition at its 
best. 

Remark: The previous paragraph, combined with a basic result from Fourier analysis, 
allows us to solve (in principle) any equation of the form 

x + 2yx + ax = f(t). (3.26) 

Fourier analysis says that any (nice enough) function /(t) may be decomposed into its 
Fourier components, 

/(f) = f g(w)e iwt dw, (3.27) 

In this continuous sum, the functions g(ui) take the place of the coefficients C n in eq. (3.25). 
So, if S u (t) is the solution for x(t) when there is only the term e iut on the right-hand side 
of eq. (3.26) (that is, S u (t) is the solution given in eq. (3.22), without the Co factor), then 
the principle of superposition tells us that the complete particular solution to (3.26) is 

x(t)=J g(u)S u (t)du>. (3.28) 

Finding the coefficients g(uj) is the hard part (or, rather, the messy part), but we won’t 
get into that here. We won’t do anything with Fourier analysis in this book, but we just 
wanted to let you know that it is possible to solve (3.26) for any function /(t). Most 
of the functions we’ll consider will be nice functions like coswot, for which the Fourier 
decomposition is simply the finite sum, cos u> 0 t = |(e* Wot + e~ Wot ). X 
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Let’s now do a physical example. 


Jh^LgJLg-|~^~| 


m 


Figure 3.7 


Example (Damped and driven spring) : Consider a spring with spring constant 
k. A mass to at the end of the spring is subject to a drag force proportional to its 
velocity, Ff = —bv. The mass is also subject to a driving force, F d (t) = F d cos ui d t 
(see Fig. 3.7). What is its position as a function of time? 


Solution: The force on the mass is F(x,x,t ) = —bx — kx + Fhcosa^f. So F = ma 
gives 

x + 2ya; + u> 2 x = F cos u>dt 

= f (e iUdt . (3.29) 

where 27 = b/m, lo = \fkfm , and F = F d /m. Note that there are two different 
frequencies here, u and uj d , which need not have anything to do with each other. Eq. 
(3.22), along with the principle of superposition, tells us that our particular solution 
is 


F/2 

\- 2i-yu) d -+ 


F/2 

\ - 2 iq-t 


(3.30) 


The complete solution is the sum of this particular solution and the homogeneous 
solution from eq. (3.16). 

Let’s now eliminate the i’s in eq. (3.30) (which we had better be able to do, because 
x must be real), and write x in terms of sines and cosines. Getting the i’s out of the 
denominators, and using e 10 = cos 0 + i sin 6, we find (after a little work) 


x p (t) = 


( FjuP-ufi \ 


COS U)dt + 


2Fju>d 


sinwdh 


(3.31) 


Remarks: If you want, you can solve eq. (3.29) simply by taking the real part of the 
solution to eq. (3.20), that is, the x(t) in eq. (3.22). This is true because if we take the real 
part of eq. (3.20), we obtain 

^(Re(a:))+27|(Re(a:))+a(Re( a; )) = Re (Coe^*) 

= Co cos(a;of)- (3.32) 

In other words, if x satisfies eq. (3.20) with a Coe lulot on the right-hand side, then Re(x) 
satisfies it with a Co cos(o>o t) on the right. 

At any rate, it is clear that (with Co = F) the real part of the solution in eq. (3.22) does 
indeed give the result in eq. (3.31), because in eq. (3.30) we simply took half of a quantity 
plus its complex conjugate, which is the real part. 

If you don’t like using complex numbers, another way of solving eq. (3.29) is to keep it in 
the form with the cos uidt on the right, and simply guess a solution of the form A cos u>dt *f* 
B sin uj d L. and then solve for A and B (this is the task of Problem 5). The result will be eq. 
(3.31). A 

We can now write eq. (3.31) in a very simple form. If we define 




(3.33) 
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then we can rewrite eq. (3.31) as 


x p (t) = 


where is defined by 
w 2 — up 

COS 0 = „ d , 


F ({ufi-uft 
R\ R 
F 

— COS (cop - (j>) 


27 UJ d \ 

cop H-— sin cop 1 


sin^ = 


M' 


27 Ud 


(3.34) 


(3.35) 


The triangle describing the angle (jo is shown in Fig. 3.8. Note that 0 < 0 < tt, 
because sin 0 is positive. 

Recalling the homogeneous solution in eq. (3.16), we can write the complete solution 
to eq. (3.29) as 


x(t) = ^ cos (cop - 4>) + e 7t (Ae nt + Be nt ) . (3.36) 

The constants A and B are determined from the initial conditions. Note that if there 
is any damping at all in the system (that is, if 7 > 0), then the homogeneous part of 
the solution goes to zero for large t, and we are left with only the particular solution. 
In other words, the system approaches a definite x(t), namely x p (t), independent of 
the initial conditions. 



oT-oiJ 

Figure 3.8 


Resonance 

The amplitude of the motion given in eq. (3.34) is proportional to 


R pTNfp+vPp' 

Given u>d and 7, this is maximum when uj = uod- Given co and 7, it is maximum 
when u>d = vV 2 — as you can show in Exercise 15. But for weak damping (that 
is, 7 «w, which is usually the case we are concerned with), this reduces to a~ a; 
also. 

The term resonance is used to describe the situation where the amplitude of the 
oscillations is as large as possible. It is quite reasonable that this is achieved when 
the driving frequency equals the frequency of the spring. But what is the value of 
the phase </> at resonance? Using eq. (3.35), we see that 0 satisfies tan r/» « 00 when 
uod ~ 00 . Therefore, 0 = tt/2 (it is indeed ir/2, and not — 7r/2, because the sin 0 in eq. 
(3.35) is positive), and the motion of the particle lags the driving force by a quarter 
of a cycle at resonance. For example, when the particle moves rightward past the 
origin (which means it has a quarter of a phase to go before it hits the maximum 
value of x ), the force is already at its maximum. And when the particle makes it 
out to the maximum value of x, the force is already back to zero. 


2yco d 
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The fact the force is maximum when the particle is moving fastest makes sense 
from an energy point of view. If you want the amplitude to become large, then you 
will need to give the system as much energy as you can. That is, you must do as 
much work as possible on the system. And in order to do as much work as possible, 
you should have your force act over as large a distance as possible, which means 
that you should apply your force when the particle is moving fastest, that is, as it 
speeds past the origin. And similarly, you don’t want to waste your force when the 
particle is barely moving near the endpoints of its motion. 

In short, v is the derivative of x and therefore a quarter cycle ahead of x (which 
is a general property of a sinusoidal function, as you can show). Since we want the 
force to be in phase with v at resonance (by the above energy argument), we see 
that the force is also a quarter cycle ahead of x. 


The phase (f> 


Eq. (3.35) gives the phase of the motion as 


tan</> = 


2q u d 

/ ,2 _ , ,2 ’ 

W UJ d 


(3.38) 


where 0 < (j) < ir. Let’s look at a few cases for uid (not necessarily at resonance) and 
see what the resulting phase <fi is. Using eq. (3.38), we have: 


• Ud « 0 ==> <p ~ 0. This means that the motion is in phase with the force. 
Intuitively, the mass moves very slowly if a;^ ~ 0, so the motion basically 
just follows the force. A little more mathematically: Since there is essentially 
no acceleration, the net force is always essentially zero. This means that the 
driving force always essentially balances the spring force (that is, the two 
forces are 180° out of phase), because the damping force is negligible (since 
v ~ 0). But the spring force is 180° out of phase with the motion (because of 
the minus sign in F = —kx). Therefore, the driving force is in phase with the 
motion. 


• Ud ~ wo ==> 4> « it/ 2. This is the case of resonance, discussed above. 

• UJd ~ oo => (j) ~ 7T. This means that the motion is out of phase with the force. 
The reason for this is the following. If Ud ~ oo, then the mass moves back and 
forth very quickly. From eq. (3.37), we see that the amplitude is proportional 
to 1/a i d . It then follows that the velocity goes like 1/a;<*. Therefore, both x and 
v are always small; the mass hardly moves. But if x and v are always small, 
then the spring and damping forces can be ignored. So we basically have a 
mass that feels only one force, the driving force. But we already understand 
very well a situation where a mass is subject to only one oscillating force: a 
mass on a spring. Now, the mass can’t tell if it’s being driven by an oscillating 
driving force, or being pushed and pulled by an oscillating spring force. They 
both feel the same. Therefore, both phases must be the same. But in the 
spring case, the minus sign in F = —kx tells us that the force is 180° out of 
phase with the motion. Hence, the same result holds in the u>d ~ oo case. 
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3.5 Coupled oscillators 

In the previous sections, we have dealt with only one function of time, x(t). What 
if we have two functions of time, say x(t) and y{t). which are related by a pair of 
“coupled” differential equations? For example, we might have 

2x + u 2 {5x-3 y) = 0, 

2y + u 2 {5y-3x) = 0. (3.39) 

We’ll assume u> 2 > 0 here, but this isn’t necessary. We call these equations “coupled” 
because there are x’s and y's in both of them, and it is not immediately obvious how 
to separate them to solve for x and y. There are two methods (at least) of solving 
these equations. 

First method: Sometimes it is easy, as in this case, to find certain linear combi¬ 
nations of the given equations for which nice things happen. If we take the sum, we 
find 

(x + y) + u?(x + y) = 0. (3.40) 

This equation involves x and y only in the combination of their sum, x + y. With 
z = x + y, eq. (3.40) is just our old friend, z + uj 2 z = 0. The solution is 

x + y = Aicos(wt + fa), (3.41) 

where A] and <p\ are determined from initial conditions. We may also take the 
difference of eqs. (3.39). The result is 

(x - y) + £u 2 (x - y) = 0. (3.42) 

This equation involves x and y only in the combination of their difference, x — y. 
The solution is 

x — y = A 2 cos(2u )t + (f> 2 ), (3.43) 

Taking the sum and difference of eqs. (3.41) and (3.43), we find that x(t ) and y(t) 
are given by 


x(t) = Bi cos(u )t + (j) 1) + B 2 cos(2 ut + </> 2 ), 

y(t) = Bi cos(u )t + </>i) — B 2 cos(2c vt + </) 2 ), (3.44) 

where the £>,’s are half of the A, 's. 

The strategy of this solution was simply to fiddle around and try to form dif¬ 
ferential equations that involved only one combination of the variables, namely eqs. 
(3.40) and (3.42). This allowed us to write down the familiar solution for these 
combinations, as in eqs. (3.41) and (3.43). 

We’ve managed to solve our equations for x and y. However, the more interesting 
thing that we’ve done is produce the equations (3.41) and (3.43). The combinations 
(x + y) and (x — y) are called the normal coordinates of the system. These are the 
combinations that oscillate with one pure frequency. The motion of x and y will, in 
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general, look rather complicated, and it may be difficult to tell that the motion is 
really made up of just the two frequencies in eq. (3.44). But if you plot the values 
of ( x + y) and (x — y) as time goes by, for any motion of the system, then you will 
find nice sinusoidal graphs, even if x and y are each behaving in a rather unpleasant 
manner. 

Second method: In the above method, it was fairly easy to guess which combina¬ 
tions of eqs. (3.39) produced equations involving just one combination of x and y. 
eqs. (3.40) and (3.42). But surely there are problems in physics where the guessing 
isn’t so easy. What do we do then? Fortunately, there is a fail-proof method for 
solving for x and y. It proceeds as follows. 

In the spirit of Section 3.1, let us try a solution of the form x = Ae iat and 
y = Be' at . which we will write, for convenience, as 



(3.45) 


It is not obvious that there should exist solutions for x and y that have the same t 
dependence, but let’s try it and see what happens. Note that we’ve explicitly put 
the i in the exponent, but there’s no loss of generality here. If a happens to be 
imaginary, then the exponent is real. It’s personal preference whether or not you 
put the i in. 

Plugging our guess into eqs. (3.39), and dividing through by e lwt , we find 


2A(-a 2 ) + 5Aia 2 - 3Bv 2 = 0, 
2B(-a 2 ) + bBu 2 - 3Au 2 = 0, 

or equivalently, in matrix form, 


( —2a 2 + 5a; 2 —3a; 2 A 

(A) 

1 _ ( 0 ) 

y —3a; 2 —2a 2 + 5a; 2 J 


Ho 


(3.46) 


(3.47) 


This homogeneous equation for A and B has a nontrivial solution (that is, one where 
A and B aren’t both 0) only if the matrix is not invertible. This is true because if 
it were invertible, then we could simply multiply through by the inverse to obtain 
(A,B) = (0,0). 

When is a matrix invertible? There is a straightforward (although tedious) 
method for finding the inverse of a matrix. It involves taking cofactors, taking a 
transpose, and dividing by the determinant. The step that concerns us here is the 
division by the determinant. The inverse will exist if and only if the determinant is 
not zero. So we see that eq. (3.47) has a nontrivial solution only if the determinant 
equals zero. Because we seek a nontrivial solution, we must therefore have 

_ I —2a 2 + 5a; 2 —3 a; 2 

| —3a; 2 —2a 2 + 5a; 2 | 

= 4a 4 — 20a 2 a; 2 + 16a; 4 . 


(3.48) 
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The roots of this equation are a = ±cu and a = ±2 lu. We have therefore found 
four types of solutions. If a = +.w, then we can plug this back into eq. (3.47) 
to obtain A = B. (Both equations give this same result. This was essentially the 
point of setting the determinant equal to zero.) And if a = ±2u, then eq. (3.47) 
gives A = —B. (Again, the equations are redundant.) Note that we cannot solve 
specifically for A and B , but only for their ratio. Adding up our four solutions 
according to the principle of superposition, we see that x and y take the general 
form (written in vector form for the sake of simplicity and bookkeeping), 



+ A 3 ^ ^ ^ e 2iwt + A 4 ^ ^ e~ 2iu)t . (3.49) 

The four A* are determined from the initial conditions. 

We can rewrite eq. (3.49) in a somewhat cleaner form. If the coordinates x and 
y describe the positions of particles, they must be real. Therefore, A\ and A 2 must 
be complex conjugates, and likewise for A3 and A 4 . If we then define some 0’s and 
B' s via A% = A\ = (Bi/2)e l ^ >1 and A| = A 3 = (B 2 /2)e l( t >2 , we may rewrite our 
solution in the form, as you can verify, 



cos {ut + 01 ) + B 2 


'j ^ cos(2o+ + 0 2 ), 


(3.50) 


where the Bi and 0,; are real (and are determined from the initial conditions). We 
have therefore reproduced the result in eq. (3.44). 

It is clear from eq. (3.50) that the combinations x + y and x — y (the normal 
coordinates) oscillate with the pure frequencies, u and 2u. respectively. The com¬ 
bination x + y makes the B 2 terms disappear, and the combination x — y makes the 
B\ terms disappear. 

It is also clear that if B 2 = 0, then x = y at all times, and they both oscillate 
with frequency uj. And if B\ = 0, then x = —y at all times, and they both oscillate 
with frequency 2 u. These two pure-frequency motions are called the normal modes. 
They are labeled by the vectors (1,1) and (1,-1), respectively. In describing a 
normal mode, both the vector and the frequency should be stated. The significance 
of normal modes will become clear in the following example. 


k k k 
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Example (Two masses, three springs): Consider two masses, to, connected to Figure 3.9 

each other and to two walls by three springs, as shown in Fig. 3.9. The three springs 
have the same spring constant k. Find the positions of the masses as functions of 
time. What are the normal coordinates? What are the normal modes? 


Solution: Let x\ (f) and x 2 (t) be the positions of the left and right masses, respec¬ 
tively, relative to their equilibrium positions. Then the middle spring is stretched a 
distance x 2 — x-\. Therefore, the force on the left mass is —kx 1 + k(x 2 — aq), and 
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the force on the right mass is —kx 2 — k(x 2 — aq). It’s easy to make a mistake on the 
sign of the second term in these expressions. You can double check the sign by, say, 
looking at the force when x 2 is very big. At any rate, the second terms must have 
opposite signs in the two expressions, by Newton’s third law. 

With these forces, F = ma on each mass gives, with u 2 = k/m , 

x\ + 2cj 2 xi — u 2 x 2 = 0 , 

x 2 + 2u! 2 x 2 — u) 2 xi = 0. (3.51) 


These are rather friendly-looking coupled equations, and we can see that the sum and 
difference are the useful combinations to take. The sum gives 

(xi + X 2 ) + UJ 2 (X! + x 2 ) = 0, (3.52) 

and the difference gives 

(xi — x 2 ) + 3w 2 (xi — x 2 ) = 0. (3.53) 

The solutions to these equations are the normal coordinates, 

X 1 +X 2 = A + cos(u>t + </>+), 

X 1 — X 2 = A_ cos (VSwt + 4>-). (3.54) 

Taking the sum and difference of these normal coordinates, we have 

xi(t) = B + cos (ixt + <j) + ) + S_ cos (VSujt + </>_), 

x 2 (t) = B + cos(wf + (j) + ) — S_ cos(\/3wf + 0_), (3.55) 

where the S’s are half of the A’s. They are determined from the initial conditions, 
along with the 0’s. 


Remark: We can also derive eqs. (3.55) by using the determinant method. Letting 

xi = Ae lat and x 2 = Be lat in eqs. (3.51), we see that for there to be a nontrivial solution 
for A and B, we must have 



= a 4 - 4a V + 3u> 4 . (3.56) 

The roots of this equation are a = ±w and a = ±\/3w. If a = ±a>, then eq. (3.51) gives 
A = B. If a = ±\/3w, then eq. (3.51) gives A = —B. The solutions for x\ and x 2 therefore 


the general form 



(») - 

Ai ( ] ) e JU,t + A 2 ( ] ) e lwt 


\X2 ) 

\ 1 ) V 1 ) 

+ A 3 ^ ^ e ' /Siut + A 4 ( ^ ^ e-^ 



B + f | 'j cos(u>f + 0+) + R_ f ^ 'j cos(\/3wt + 0_). 

(3.57) 


This is equivalent to eq. (3.55). & 
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The normal modes are obtained by setting either B_ or B + equal to zero in eq. (3.55) 
or eq. (3.57). Therefore, the normal modes are (1,1) and (1, —1). How do we visualize 
these? 

The mode (1,1) oscillates with frequency u. In this case (where B_ = 0), we have 
x\ (f) = X 2 (f) = B + cos(ud + <t> + ) at all times. So the masses simply oscillate back 
and forth in the same manner, as shown in Fig. 3.10. It is clear that such motion 
has frequency ui, because as far as the masses are concerned, the middle spring is 
effectively not there, so each mass moves under the influence of only one spring, and 
therefore has frequency u>. 

The mode (1,-1) oscillates with frequency \/3lu. In this case (where B + = 0), we 
have xi (t) = —X 2 (t) = B_ cos(\/3 uit + 0_) at all times. So the masses oscillate back 
and forth with opposite displacements, as shown in Fig. 3.11. It is clear that this 
mode should have a frequency larger than that for the other mode, because the middle 
spring is stretched (or compressed), so the masses feel a larger force. But it takes a 
little thought to show that the frequency is \f3uj. A 


Remark: The normal mode (1,1) above is associated with the normal coordinate 
Xi+X 2 - They both involve the frequency ui. However, this association is not due to the fact 
that the coefficients of both x\ and x -2 in this normal coordinate are equal to 1. Rather, it is 
due to the fact that the other normal mode, namely ( X\,X 2 ) oc (1, —1), gives no contribution 
to the sum x\ + X 2 - 

There are a few too many l’s floating around in the above example, so it’s hard to see 
which results are meaningful and which results are coincidence. But the following example 
should clear things up. Let’s say we solved a problem using the determinant method, and 
we found the solution to be 

( y ) = ^1 ( 2 ) Cos ^ UJlt + ^ + -®2 ( J*5 ) COS (u) 2 t + (j>2). (3.58) 


|JLJ2_Q^LSLa*iUU| 
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Figure 3.10 
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Figure 3.11 


Then 5x + y is the normal coordinate associated with the normal mode (3,2), which has 
frequency uq. (This is true because there is no cos(u^t + <h) dependence in the combina¬ 
tion 5x + y.) And similarly, 2x — 3 y is the normal coordinate associated with the normal 
mode (1, —5), which has frequency x >2 (because there is no cos(uqt + (p \) dependence in the 
combination 2x — 3 y). 4k 

Another Remark: Note the difference between the types of differential equations 
we solved in the previous chapter in Section 2.3, and the types we solved throughout this 
chapter. The former dealt with forces that did not have to be linear in x or x, but which 
had to depend on only x, or only x, or only t. The latter dealt with forces that could depend 
on all three of these quantities, but which had to be linear in x and x. 4k 


3 If you want to obtain this y/3u> result without going through all of the above work, just note 
that the center of the middle spring doesn’t move. Therefore, it acts like two “half springs,” each 
with spring constant 2k (verify this). Hence, each mass is effectively attached to a “fc” spring and 
a “2fc” spring, yielding a total effective spring constant of 3k. Thus the C'i. 
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3.6 Exercises 

Section 3.1: Linear differential equations 

1 . kx force * 

A particle of mass m is subject to a force F{x) = kx. What is the most general 
form of x(t)‘! If the particle starts out at xq. what is the one special value of 
the initial velocity for which the particle doesn’t eventually get far away from 
the origin? 

2. Rope on a pulley ** 

A rope of length L and mass density pkg/m hangs over a massless pulley. 
Initially, the ends of the rope are a distance xo above and below their average 
position. The rope is given an initial speed. If you want the rope to not 
eventually fall off the pulley, what should this initial speed be? 

Section 3.2: Simple harmonic motion 

3. Amplitude 

Find the amplitude of the motion given by x(t) = C cos cat + D sin cut. 

4. Angled rails * 

Two particles of mass m are constrained to move along two horizontal rails 
that make an angle of 26 with respect to each other, as shown in Fig. 3.12. 
They are connected by a spring with spring constant k. What is the frequency 
of oscillations for the motion where the spring remains parallel to the position 
shown? 

5. Springs all over ** 

(a) A mass m is attached to two springs that have equilibrium lengths equal 
to zero. The other ends of the springs are fixed at two points (see 
Fig. 3.13). The two spring constants are equal. The mass sits at its 
equilibrium position and is then given a kick in an arbitrary direction. 
Describe the resulting motion. (Ignore gravity, although you actually 
don’t need to.) 

(b) A mass m is attached to a number of springs that have equilibrium lengths 
equal to zero. The other ends of the springs are fixed at various points in 
space (see Fig. 3.14). The spring constants are all the same. The mass 
sits at its equilibrium position and is then given a kick in an arbitrary 
direction. Describe the resulting motion. (Again, ignore gravity, although 
you actually don’t need to.) 

6. Removing a spring * 

The springs in Fig. 3.15 are at their natural equilibrium length. The mass 
oscillates along the line of the springs with amplitude d. At the moment (let 
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Figure 3.13 



Figure 3.14 



Figure 3.12 


Figure 3.15 
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this be t = 0 ) when the mass is at position x = d/2 (and moving to the right), 
the right spring is removed. What is the resulting x(t)‘! What is the amplitude 
of the new oscillation? 


7. Changing k ** 

Two springs each have spring constant k and relaxed length I. They are both 
stretched a distance l and attached to a mass m and two walls, as shown in 
Fig. 3.16. At a given instant, the right spring constant is somehow magically 
changed to 3k (the relaxed length remains I). At a time t = later, what 

is the mass’s position? What is its speed? 


8. Corrections to the pendulum *** 


(a) For small oscillations, the period of a pendulum is approximately T sa 
2tt y/Ifg, independent of the amplitude, 9q. For finite oscillations, show 
that the exact expression for T is 


T = 



dO 

V cos 9 — cos 6 q 


(3.59) 


(b) Find an approximation to this T, up to second order in 9q, in the following 
way. Make use of the identity cos 0 = 1—2 sin 2 (0/2) to write T in terms of 
sines (because it’s more convenient to work with quantities that go to zero 
as 9 —* 0). Then make the change of variables, sin x = sin(0/2)/ sin(0o/2). 
Finally, expand your integrand in powers of 9o, and perform the integrals 
to show that 4 

(3,6o) 
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Figure 3.16 


Section 3.3: Damped harmonic motion 

9. Crossing the origin 

Show that an overdamped or critically damped oscillator can cross the origin 
at most once. 

10. Strong damping * 

In the strong damping (7 a;) case discussed in the remark in the overdamp¬ 

ing subsection, we saw that x(t) oc t ^ 2y . Show that this can be written 
as x(t) oc e~ kt / b , where b is the coefficient of the damping force. And then 
explain, by looking at the forces on the mass, why this makes sense. 

11. Minimum speed * 

A critically damped oscillator with natural frequency u starts out at position 
xq. What is the minimum initial speed it must have if it is to cross the origin? 

4 If you like this sort of thing, you can show that the next term in the parentheses is (11/3072 )9q. 
But be careful, this fourth-order correction comes from two terms. 
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12. Another minimum speed ** 

An overdamped oscillator with natural frequency u and damping coefficient 7 
starts out at position xq. What is the minimum initial speed it must have if 
it is to cross the origin? 

13. Maximum speed ** 

A mass on the end of a spring is released from rest at position xq . The 
experiment is repeated, but now with the system immersed in a fluid that 
causes the motion to be critically damped. Show that the maximum speed of 
the mass in the first case is e times the maximum speed in the second case . 5 

14. Work * 

A damped oscillator has initial position xo and speed vq. After a long time, it 
will essentially be at rest at the origin. Therefore, by the work-energy theorem, 
the work done by the damping force must equal —Atcq/2 — rriv( ) /2. Verify that 
this is true. Hint: It’s a bit messy to find x in terms of the initial conditions 
and then calculate the desired integral. An easier way is to use the F = ma 
equation to rewrite one of the x’s in your integral. 

Section 3.4-' Driven (and damped) harmonic motion 

15. Resonance 

Given uj and 7 , show that the R in eq. (3.33) is minimum when uj c i = 
sjw 2 — 27 2 (unless this is imaginary, in which case the minimum occurs at 
w d = 0 ). 

16. No damping force * 

A particle of mass m is subject to a spring force, — kx, and also a driving force, 
Fdcosujdt. But there is no damping force. Find a solution for x(t) by guessing 
x(t) = A cos uj c jt + B sin u>dt. If you write your solution for x(t) in the form 
C cos((jj r jt — (j)), what are C and (jp. Be careful about the phase. 

17. No spring force * 

A particle of mass m is subject to a damping force, —bv, and also a driving 
force, Fd cos Udt. But there is no spring force. Find a solution for x(t) by 
guessing x{t) = A cos wfy + B sin If you write your solution for x(t') in the 
form C cos(uj dt — 4>), what are C and 0? 

Section 3.5: Coupled oscillators 

5 The fact that the maximum speeds differ by a fixed numerical factor follows from dimensional 
analysis, which tells us that the maximum speed in the first case must be proportional to uixq. 
And since 7 = u> in the critical-damping case, the damping doesn’t introduce a new parameter, 
so the maximum speed has no choice but to again be proportional to ojx 0. But showing that the 
maximum speeds differ by the nice factor of e requires a calculation. 
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18. Springs between walls ** 

Four identical springs and three identical masses lie between two walls (see 
Fig. 3.17). Find the normal modes. 

19. Springs and one wall ** 

Two identical springs and two identical masses are attached to a wall as shown 
in Fig. 3.18. Find the normal modes. 

20. Coupled and damped ** 

The system in the example in Section 3.5 is modified by immersing it in a 
fluid so that both masses feel a damping force, Ej = —bv. Solve for x\{t) and 
x 2 (t). 

21. Coupled and driven ** 

The system in the example in Section 3.5 is modified by subjecting the left 
mass to a driving force cos(2cji), and the right mass to a driving force 
2F^cos(2o;t), where oj = y/k/m. Find the particular solution for x\(t) and 


k k k k 
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Figure 3.17 
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Figure 3.18 
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3.7 Problems 

Section 3.1: Linear differential equations 

1. A limiting case * 

Consider the equation ic = ax. If a = 0, then the solution to x = 0 is of course 
x(t ) = C + Dt. Show that in the limit a —> 0. eq. (3.2) reduces to this form. 
Note: a —* 0 is a very sloppy way of saying what we mean. What is the proper 
way to write this limit? 

Section 3.2: Simple harmonic motion 

2. Average tension ** 

Is the average (over time) tension in the string of a pendulum larger or smaller 
than mg? By how much? As usual, assume that the angular amplitude A is 
small. 

Section 3.3: Damped harmonic motion 

3. Maximum speed ** 

A mass on the end of a spring (with natural frequency u) is released from 
rest at position xq. The experiment is repeated, but now with the system 
immersed in a fluid that causes the motion to be overdamped (with damping 
coefficient 7 ). Find the ratio of the maximum speed in the former case to that 
in the latter. What is the ratio in the limit of strong damping (7 3> to)? In 
the limit of critical damping? 

Section 3.4-' Driven (and damped) harmonic motion 

4. Exponential force * 

A particle of mass m is subject to a force F(t) = me~ bt . The initial position 
and speed are both zero. Find x(t). 6 

5. Driven oscillator = 1 = 

Derive eq. (3.31) by guessing a solution of the form x(t ) = A cos uff+B sin ujff 
in eq. (3.29). 

Section 3.ff Coupled oscillators 

6. Unequal masses ** 

Three identical springs and two masses, m and 2m, lie between two walls as 
shown in Fig. 3.19. Find the normal modes. 




Figure 3.19 


6 This problem was already given as Problem 2.9, but solve it here by guessing an exponential 
function, in the spirit of Section 3.4. 
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7. Driven mass on a circle ** 

Two identical masses m are constrained to move on a horizontal hoop. Two 
identical springs with spring constant k connect the masses and wrap around 
the hoop (see Fig. 3.20). One mass is subject to a driving force, F<i cos ujdt. 
Find the particular solution for the motion of the masses. 

8. Springs on a circle **** 

(a) Two identical masses m are constrained to move on a horizontal hoop. 
Two identical springs with spring constant k connect the masses and 
wrap around the hoop (see Fig. 3.21). Find the normal modes. 

(b) Three identical masses are constrained to move on a hoop. Three identical 
springs connect the masses and wrap around the hoop (see Fig. 3.22). 
Find the normal modes. 

(c) Now do the general case with N identical masses and N identical springs. 



Figure 3.20 



Figure 3.21 



Figure 3.22 
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3.8 Solutions 

1. A limiting case 

The expression “a —> 0” is sloppy, because a has units of [time] -2 , and the number 0 
has no units. The proper statement is that eq. (3.2) reduces to x(t) = C + Dt when 
yfat <C 1, or equivalently when t 1/y/a, which is now a comparison of quantities 
with the same units. The smaller a is, the larger t can be. Therefore, if “o —> 0,” 
then t can basically be anything. 

Under the condition y/at <C 1, we can write e ±x/at ~1± yjat. Therefore, eq. (3.2) 
becomes 

x (t) ~ A(1 + yfat) + B(1 — yfat) 

= (A + B ) + y/a(A — B)t 

= C + Dt. (3.61) 

If a is small but nonzero, then t will eventually become large enough so that fa t -C 1 
doesn’t hold, in which case the linear form in eq. (3.61) isn’t valid. 

2. Average tension 

Let the length of the pendulum be £. We know that the angle 9 depends on time 
according to 

9(t) = Acos(ut), (3.62) 

where ui = \fgji. If T is the tension in the string, then the radial F = ma equation 
is T — mg cos 6 = mlO 2 . Using eq. (3.62), this becomes 

T = mg cos ^Acos(wf)^ + mi(^ — wAsin (wf)) 2 . (3.63) 

Since A is small, we can use the small-angle approximation cos a « 1 — a 2 /2, which 
gives 


T M mg ^1 — ^A 2 cos 2 (a;f)^ + m£u 2 A 2 sin 2 (u;f) 

= mg + mg A 2 ^sin 2 (a;t) — ^ cos 2 (a;t)^ . (3.64) 

The average value of both sin 2 0 and cos 2 0 over one period is 1/2, 7 so the average 
value of T is 

T = mg + ^ (3.65) 


which is larger than mg, by mg A 2 /A. 


Remark: It makes sense that T > mg, because the average value of the vertical component 
of T equals mg (because the pendulum has no net rise or fall over a long period of time), and 
there is some non-zero contribution to the magnitude of T from the horizontal component. 

* 


7 You can show this by doing the integrals, or by noting that the averages are equal and that 
they add up to 1. 
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3. Maximum speed 

For the undamped case, the general form of x is x(t) = Ccos(ujt + 0). The initial 
condition u( 0 ) = 0 tells us that </> = 0 , and then the initial condition x(Q) = xo tells 
us that C = xo- Therefore, x(t) = xocos(u;t), and so v(t) = —u>xosin(o;t). This has 
a maximum magnitude of ujx o- 

Now consider the overdamped case. Eq. (3.18) gives the position as 

x(t) = Ae-^~ n » + Be-^+W. (3.66) 


The initial conditions are 


a:( 0 ) = xo => A + B = xo, 

u(0) = 0 => -( 7 -fi)A-( 7 +fi)B = 0. (3.67) 

Solving these equations for A and B, and then plugging the results into eq. (3.66), 
gives 

x (t ) = ^ ((7 + ft)e-^- n >‘ - ( 7 - 0)e-^+ n )‘) . (3.68) 

Taking the derivative to find v(t), and using j 2 — Q 2 = u> 2 , gives 

y (t) = (e-^¥ - e-(^) . (3.69) 


Taking the derivative one more time, we find that the maximum speed occurs at 

= < 3 ™> 


Plugging this into eq. (3.69), and taking advantage of the logs in the exponentials, 
gives 


v(t mi 


( 

Vr-n. 


)) 


h+n 




(3.71) 


The desired ratio, R, of the maximum speeds in the two scenarios is therefore 


R = 


-n) 


In the limit of strong damping (7 u>), we have ft = \J~t 2 — UJ 2 sa 7 — u, 

ratio becomes 

In the limit of critical damping (7 w w, fl « 0), we have, with fi /7 = e, 


R* 




l/2e 


a (1 + 2 e) 


l/2e 


(3.72) 
i /2'y. So the 

(3.73) 

(3.74) 


in agreement with the result of Exercise 13. You can also show that in these two 
limits, t max equals ln( 27 /w )/7 and I /7 sa 1 /w, respectively. 
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4. Exponential force 

Let’s guess a particular solution to x = e~ bt of the form x(t) = Ce~ bt . We find 
C = 1 /b 2 . And since the solution to the homogeneous equation x = 0 is x(t) = At+B, 
the complete solution for x is 

-6t 

x(t) = —+At + B. (3.75) 

The initial condition x((Y) = 0 gives B = —l/b 2 . And the initial condition u(0) = 0 
applied to v(t) = —e~ bt /b + A gives A = l/b. Therefore, 

<3 ' 76) 


5. Driven oscillator 

Plugging x(t) = A cos uj,[t + B sin uj ( jt into eq. (3.29) gives 

— uj^AcosuJdt — JdBs\nu>dt 

- 2 juJdA sin u> d t + 2ju> d B cos u)dt 

+ u> 2 Acosu) d t + u 2 Bsmu>dt = F cos u> d t- (3.77) 


If this is to be true for all t, the coefficients of cos oj r [t on both sides must be equal. 
And likewise for sinu^t. Therefore, 


— u>dA + 2q UdB + u> 2 A — F, 
— ijJ^B — 2 'yu>dA + u ) 2 B = 0. 


Solving this system of equations for A and B gives 

F(u> 2 -^) _ 2 F^ d 

( W 2 _ w 2) 2 + 472a ; 2 > ( w 2 _ w 2 )2 +472w2 


(3.78) 


(3.79) 


in agreement with eq. (3.31). 

6. Unequal masses 

Let xi and x- 2 be the positions of the left and right masses, respectively, relative to 
their equilibrium positions. The forces on the two masses are —kx\ + k(x 2 — x-\ ) and 
—kx 2 — k(x 2 — x\), respectively, so the F = mo equations are 


Xi + 2w 2 xi — CJ 2 X2 = 0 , 

2x 2 + 2uj 2 X2 — uj 2 x i = 0. (3.80) 


The appropriate linear combinations of these equations aren’t obvious, so we’ll use 
the determinant method. Letting x\ = A-\e lat and x 2 = A 2 e lat , we see that for there 
to be a nontrivial solution for A and B, we must have 


0 


— u) 2 —2a 2 

2a 4 — 6 a 2 u 2 + 3 w 4 . 


The roots of this equation are 




(3.81) 


a = 


and a = ±uh 


(3.82) 
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If a 2 = a\, then the normal mode is proportional to (\/3 + 1, —1). And if a 2 = a 2 , 
then the normal mode is proportional to (\/3 — 1,1). So the normal modes are 

(^2) = ( 1 ) C0S ( ai * + < M’ and 

(ll) = (^l -1 ) 008 (“ 2 *+ 02), (3.83) 

Note that these two vectors are not orthogonal (there is no need for them to be). 
You can show that the normal coordinates associated with these normal modes are 
X\ — (\/3 — l)a :2 and x\ + (\/3+ 1 ) 2 : 2 , respectively, because these are the combinations 
that make the 0:2 and aq frequencies disappear, respectively. 

7. Driven mass on a circle 

Label two diametrically opposite points as the equilibrium positions. Let the distances 
from the masses to these points be x\ and x -2 (measured counterclockwise). If the 
driving force acts on mass “1”, then the F = ma equations are 

mxi + 2k(x\ — X 2 ) = F d cos u>dt, 

mx 2 + 2 k(x 2 — x\) = 0. (3.84) 

To solve these equations, we can treat the driving force as the real part of F d e Wdt and 
try solutions of the form x\(t) = Aie Wdt and X 2 (t) = A^e WAt , and then solve for A\ 
and A 2 . Or we can try some trig functions. If we take the latter route, we will quickly 
find that the solutions can’t involve any sine terms (this is due to the fact that there 
are no first derivatives of the x’s in eq. (3.84)). Therefore, the trig functions must 
look like aq (t) = A 1 cos ui d t and X 2 (t) = A 2 cos ui,it. Using either of the two methods, 
eqs. (3.84) become 


—u d Ai + 2 u) 2 (A\ — A 2 ) = F, 

-u%A 2 + 2iv 2 (A 2 -A 1 ) = 0, (3.85) 

where ui = \Jk/rn and F = F d /m. Solving for A\ and A 2 , we find that the desired 
particular solution is 

—F(2u > 2 - w3) - 2 Fu 2 

Xl ^ = ^ 2 ( 4 ^ 2 -^) C0S 0>dt, X 2 ( t ) = ^2(4^2 _^2) cos “dt. (3.86) 

The most general solution is the sum of this particular solution and the homogeneous 
solution found in eq. (3.91) in Problem 8 below. 

Remarks: If Ud, = 2w, the amplitudes of the motions go to infinity. This makes sense, con¬ 
sidering that there is no damping, and that the natural frequency of the system (calculated 
in Problem 8) is 2ui. 

If ui,i = V2ui, then the mass that is being driven doesn’t move. What is going on here is 
that the driving force balances the force that the mass feels from the springs due to the 
other mass’s motion. And indeed, you can show that \/2uj is the frequency that one mass 
moves at if the other mass is at rest (and thereby acts essentially like a brick wall). Note 
that u>d = V2 cj is the cutoff between the masses moving in the same direction or in opposite 
directions. 

If u!d —> 00, then both motions go to zero. But x 2 fourth-order small, whereas xi is only 
second-order small. 
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If u>d —> 0, then Ai « A 2 « —F/2uj%. This is very large. The driving force basically spins 
the masses around in one direction, and then reverses and spins them around in the other 
direction. We essentially have the driving force acting on a mass 2 m , and two integrations 
of Fdcosuidt = (2 m)x shows that the amplitude of the motion is F/2uy^, as above. A 

8. Springs on a circle 

(a) Label two diametrically opposite points as the equilibrium positions. Let the 
distances from the masses to these points be x\ and X 2 (measured counterclock¬ 
wise) . Then the F = ma equations are 

mx i + 2 k(x\ — xT) = 0, 

mx 2 + 2k(x<i — xi) = 0. (3.87) 

The determinant method works here, but let’s just do it the easy way. Adding 
the equations gives 

*i + *2 = 0, (3.88) 

and subtracting them gives 

(ah — X 2 ) + 4w 2 (a:i — X 2 ) = 0. (3.89) 


The normal coordinates are therefore 

* 1 + 2:2 = At + B , 

* 1 — 2:2 = Ccos(2u;t + 4>). (3.90) 

Solving these two equations for x\ and £ 2 , and writing the results in vector form, 
gives 

( *1 ^ ^ j ^ (At+ B) + 0 ^ ^ cos(2 ut + <f), (3.91) 

where the constants A, B, and C are defined to be half of what they were in eq. 
(3.90). The normal modes are therefore 

(*:) = (!)<^+ b ). ^ 

[ll) = C ( )cos(2^ + #|, (3.92) 

The first mode has frequency zero. It corresponds to the masses sliding around 
the circle, equally spaced, at constant speed. The second mode has both masses 
moving to the left, then both to the right, back and forth. 

(b) Label three equally spaced points as the equilibrium positions. Let the distances 
from the masses to these points be x\, * 2 , and £3 (measured counterclockwise). 
Then the F = ma equations are, as you can show, 

m®i^fc(*i — *2) + k(x 1 — £3) = 0 , 

mx 2 + k(x 2 — £3) + k(x 2 — £1) = 0, 

rn£ 3 + k(x 3 — £1) + k(x 3 — £2) = 0. (3.93) 

The sum of all three of these equations definitely gives something nice. Also, 

differences between any two of the equations gives something useful. But let’s 
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use the determinant method to get some practice. Trying solutions of the form 
x\ = A-f e lat , X'i = A 2 e iat , and x- ti = A 3 e* at , we obtain the matrix equation, 



Setting the determinant equal to zero yields a cubic equation in a 2 . But it is a 
nice cubic equation, with a 2 = 0 as a solution. The other solution is the double 
root a 2 = 3w 2 . 

The a = 0 root corresponds to A\ = A 2 = A 3 . That is, it corresponds to the 
vector (1,1,1). This a = 0 case is the one case where our exponential solution 
isn’t really an exponential. But a 2 equalling zero in eq. (3.94) basically tells us 
that we are dealing with a function whose second derivative is zero, that is, a 
linear function At + B. Therefore, the normal mode is 

^ x 2 j = ^ 1 j (At + B). (3.95) 


This mode has frequency zero, and corresponds to the masses sliding around the 
circle, equally spaced, at constant speed. 

The two a 2 = 3 u> 2 roots correspond to a two-dimensional subspace of normal 
modes. You can show that any vector of the form (a, b, c) with a + b + c = 0 
is a normal mode with frequency \/3u>. We will arbitrarily pick the vectors 
(0,1,—1) and (1,0,—1) as basis vectors for this space. We can then write the 
normal modes as linear combinations of the vectors 


I cos(v / 3 ut + 0i), 
= C 2 i 0 I cos(\/3u;t + (j> 2 ). 


(3.96) 


Remarks: This is very similar to the example in Section 3.5 with two masses and 
three springs oscillating between two walls. The way we’ve written these modes, the 
first one has the first mass stationary (so there could be a wall there, for all the other 
two masses know). Similarly for the second mode. Hence the \/?jlo result here, as in 
the example. 

The normal coordinates in this problem are *1+0:2+ *3 (obtained by adding the 
three equations in (3.93)), and also any combination of the form ax 1 + bx 2 + CX3, 
where a + b + c = 0 (obtained by taking a times the first eq. in (3.93), plus b times 
the second, plus c times the third). The three normal coordinates that correspond to 
the mode in eq. (3.95) and the two modes we chose in eq. (3.96) are, respectively, 
X1+X2 + *3, —2*i + *2 + *3, and xi — 2* 2 + *3, because each of these combinations 
gets no contribution from the other two modes. Jft 

(c) In part (b), when we set the determinant of the matrix in eq. (3.94) equal to 
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zero, we were essentially finding the eigenvectors and eigenvalues 8 of the matrix, 



1 1 
1 1 
1 1 


(3.97) 


We haven’t bothered writing the common factor uj‘ 2 here, because it doesn’t 
affect the eigenvectors. As an exercise, you can show that for the general case of 
N springs and masses on a circle, the above matrix becomes the N x N matrix, 


1 1 
0 1 


1 0 0 
0 


\ i o o o 


1 / 


(3.98) 


In the matrix M, the three consecutive l’s keep shifting to the right, and they 
wrap around cyclicly. We must now find the eigenvectors of M, which will 
require being a little clever. 

We can guess the eigenvectors and eigenvalues of M if we take a hint from its 
cyclic nature. A particular set of things that are rather cyclic are the Nth roots 
of 1. If T] is an Nth root of 1, you can verify that ( 1 , r/, rf ...., r/ ,v_ l ) is an 
eigenvector of M with eigenvalue ry -1 + 1 + rj. (This general method works for 
any matrix where the entries keep shifting to the right. The entries don’t have 
to be equal.) The eigenvalues of the entire matrix in eq. (3.98) are therefore 
3 - (77“ 1 + 1 + rj) = 2 - ?y _1 - ?y. 

There are N different Nth roots of 1, namely r/ n = e 2nm / N , for 0 < n < N. So 
the N eigenvalues are 


^ =2 27T in/N _|_ g27rin/iV^ 


a -2 cos (27m/Af) 

I sin 2 (nn/N). (3.99) 


The corresponding eigenvectors are 

Vn=(l, Vn ,V 2 n,...^- 1 ). 


(3.100) 


Since the numbers n and N — n yield the same value for \ n in eq. (3.99), 
the eigenvalues come in pairs (except for n = 0, and n = N/2 if N is even). 
This is fortunate, because we may then form real linear combinations of the 
two corresponding complex eigenvectors given in eq. (3.100). We see that the 
vectors 


/ 


V+ = *(V n + V N _ n ) = 


1 \ 

cos(2nn/N) 
cos ( 47 vn/N) 


(3.101) 


\ cos(2 (N — l)nn/N) ) 


®An eigenvector v of a matrix M is a vector that gets taken into a multiple of itself when acted 
upon by M. That is, Mv = Xv, where A is some number (the eigenvalue). This can be rewritten as 
(M — A I)v = 0, where I is the identity matrix. By our usual reasoning about invertible matrices, 
a nonzero vector v exists only if A satisfies det | M — A/| = 0. 
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and 


v \V n -V N _ n ) 


( 0 \ 
sin(27 m/N) 
sin(47m/./V) 


(3.102) 


\ sin(2(lV — l)nn/N) ) 


both have eigenvalue \ n = Xn-h- Referring back to the N = 3 case in eq. (3.94), 
we see that we must take the square root of the eigenvalues and then multiply by 
uj to obtain the frequencies (because it was an a 2 that appeared in the matrix, 
and because we dropped the factor of uj 2 ). The frequencies corresponding to the 
above two normal modes are therefore, using eq. (3.99), 


u> n = u)\fXn = 2ujsm(-jrn/N). (3.103) 


Remark: Let’s check our results for N = 3. If n = 0, we find Ao = 0, and Vo = 
(1,1,1). If n= 1, we find Ai = 3, and V+ = (1,-1/2,-1/2) and Vf = (0,1/2,-1/2). 
These two vectors span the same space we found in eq. (3.96). And \/Ai = \/3, in 
agreement with eq. (3.96). You can also find the vectors for N = 4. These are fairly 
intuitive, so try to write them down first without using the above results. 
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Chapter 4 

Conservation of Energy and 
Momentum 

Copyright 2004 by David Morin, morin@physics.harvard.edu 


Conservation laws are extremely important in physics. They are enormously helpful, 
both quantitatively and qualitatively, in figuring out what is going on in a physical 
system. 

When we say that something is “conserved”, we mean that it is constant over 
time. If a certain quantity is conserved, for example, while a ball rolls around on 
a hill, or while a group of particles interact, then the possible final motions are 
greatly restricted. If we can write down enough conserved quantities (which we are 
generally able to do, at least for the problems in this book), then we can restrict 
the final motions down to just one possibility, and so we have solved our problem. 
Conservation of energy and momentum are two of the main conservation laws in 
physics. A third, conservation of angular momentum, is discussed in Chapters 6-8. 

It should be noted that it is not necessary to use conservation of energy and 
momentum when solving a problem. We will derive these conservation laws from 
Newton’s laws. Therefore, if you felt like it, you could always simply start with 
first principles and use F = ma, etc. You would, however, soon grow weary of this 
approach. The point of conservation laws is that they make your life easier, and they 
provide a means for getting a good idea of the overall behavior of a given system. 


4.1 Conservation of energy in 1-D 

Consider a force, in just one dimension for now, that depends only on position. That 
is, F = F(x). If we write a as vdv/dx , then F = ma becomes 

= F(x). (4.1) 

dx 

Separating variables and integrating gives mv 2 /2 = E + R F(x') dx ', where E is a 
constant of integration, dependent on the choice of xq. (We’re simply following the 
procedure in Section 2.3 here, for a function that depends only on x.) If we now 
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define the potential energy , V(x), as 


V(x) = — J F{x') dx', 

(4.2) 

then we may write 


^mw 2 + V(x) = E. 

(4.3) 


We define the first term here to be the kinetic energy. Since this equation is true 
at all points in the particle’s motion, the sum of the kinetic energy and potential 
energy is a constant. If a particle loses (or gains) potential energy, then its speed 
increases (or decreases). 

In Boston, lived Jack as did Jill, 

Who gained mgh on a hill. 

In their liquid pursuit, 

Jill exclaimed with a hoot, 

“I think we’ve just climbed a landfill!” 


While noting, “Oh, this is just grand,” 
Jack tripped on some trash in the sand. 
He changed his potential 
To kinetic, torrential, 

But not before grabbing Jill’s hand. 


Both E and V(x) depend, of course, on the arbitrary choice of x'o in eq. (4.2). 
What this means is that E and V(x) have no meaning by themselves. Only differ¬ 
ences in E and V(x) are relevant, because these differences are independent of the 
choice of xo . For example, it makes no sense to say that the gravitational poten¬ 
tial energy of an object at height y equals — J F dy = —f(—mg)dy = mgy. We 
have to say that mgy is the potential energy with respect to the ground (if your 
xo is at ground level). If we wanted to, we could say that the potential energy is 
mgy + 7mg with respect to a point 7 meters below the ground. This is perfectly 
correct, although a little unconventional. 1 

If we take the difference between eq. (4.3) evaluated at two points, x\ and x 2 , 
then we obtain 


1 

2 


V( Xl ) - V(x 2 ) 

/ X2 

F{x') dx'. 


(4.4) 


Here it is clear that only differences in energies matter. If we define the integral 
here to be the work done on the particle as it moves from x\ to x 2 , then we have 
produced the work-energy theorem, 

1 It gets to be a pain to keep repeating “with respect to the ground.” Therefore, whenever 
anyone talks about gravitational potential energy in an experiment on the surface of the earth, it 
is understood that the ground is the reference point. If, on the other hand, the experiment reaches 
out to distances far from the earth, then r = oo is understood to be the reference point, for reasons 
of convenience we will shortly see. 
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Theorem 4.1 The change in a particle’s kinetic energy between points xi and xi 
is equal to the work done on the particle between x\ and X 2 - 

If the force points in the same direction as the motion (that is, if the F(x) and 
the dx in eq. (4.4) have the same sign), then the work is positive and the speed 
increases. If the force points in the direction opposite to the motion, then the work 
is negative and the speed decreases. 

Having chosen a reference point xq for the potential energy, if we draw the V(x) 
curve and also the constant E line (see Fig. 4.1), then the difference between them 
gives the kinetic energy. The places where V(x ) > E are the regions where the 
particle cannot go. The places where V(x) = E are the “turning points” where the 
particle stops and changes direction. In the figure, the particle is trapped between 
x\ and x' 2 , and oscillates back and forth. The potential V(x) is extremely useful 
this way, because it makes clear the general properties of the motion. 

Remark: It may seem silly to introduce a specific xo as a reference point, considering 
that it is only eq. (4.4) (which makes no mention of Xo ) that has any meaning. It’s sort of 
like taking the difference between 17 and 8 by first finding their sizes relative to 5, namely 
12 and 3, and then subtracting 3 from 12 to obtain 9. However, since integrals are harder 
to do than simple subtractions, it is advantageous to do the integral once and for all and 
thereby label all positions with a definite number V{x), and to then take differences between 
the F’s when needed. 4 

Note that eq. (4.2) implies 


F(x ) = - 


dV(x) 

dx 


(4.5) 


Given V(x), it is easy to take its derivative to obtain F{x). But given F(x), it may 
be difficult (or impossible) to perform the integration in eq. (4.2) and write V(x) 
in closed form. But this is not of much concern. The function V(x) is well-defined 
(assuming that the force is a function of x only), and if needed it can be computed 
numerically to any desired accuracy. 


V(x) 



I *1 *2 

Figure 4.1 


Example 1 (Gravitational potential energy): Consider two point masses, M 
and m, separated by a distance r. Newton’s law of gravitation says that the force 
between them is attractive and has magnitude GMm/r 2 . The potential energy of the 
system at separation r, measured relative to separation ro, is 




-GMm GMm 


(4.6) 


A convenient choice for ro is oo, because this makes the second term vanish. It will be 
understood from now on that this ro = oo reference point has been chosen. Therefore 
(see Fig. 4.2), 


V(r) = 


-GMm 


(4.7) 



Figure 4.2 




IV-4 


CHAPTER 4. CONSERVATION OF ENERGY AND MOMENTUM 


Example 2 (Gravity near the earth): What is the gravitational potential energy 
of a mass m at height y, relative to the ground? We know, of course, that it is mgy, 
but let’s do it the hard way. If M is the mass of the earth and R is its radius, then 
(assuming y <C R) 


V(R + y)-V(R) 


—GMm —GMm 
R + y R 
-GMm ( 1 \ 

R \l + y/R ) 

GMmy 
R 2 ’ 


(4.8) 


where we have used the Taylor series approximation for 1/(1 + e) to obtain the third 
line. We have also used the fact that a sphere can be treated like a point mass, as far 
as gravity is concerned. We’ll prove this in Section 4.4.1. 

Using g = GM/R 2 , we see that the potential energy difference in eq. (4.8) equals 
mgy. We have, of course, simply gone around in circles here. We integrated in eq. 
(4.6), and then we basically differentiated in eq. (4.8) by taking the difference between 
the forces. But it’s good to check that everything works out. 


Remark: A good way to visualize a potential V(x ) is to imagine a ball sliding around 
in a valley or on a hill. For example, the potential of a typical spring is V(x) = kx 2 /2 (which 
produces the Hooke’s-law force, F(x) = —dV/dx = —kx), and we can get a decent idea of 
what is going on if we imagine a valley with height given by y = x 2 /2. The gravitational 
potential of the ball is then mgy = mgx 2 / 2. Choosing mg = k gives the desired potential. 
If we then look at the projection of the ball’s motion on the a;-axis, it seems like we have 
constructed a setup identical to the original spring. 

However, although this analogy helps in visualizing the basic properties of the motion, 
the two setups are not the same. The details of this fact are left for Problem 5, but the 
following observation should convince you that they are indeed different. Let the ball be 
released from rest in both setups at a large value of x. Then the force, kx, due to the spring 
is very large. But the force in the ^-direction on the particle in the valley is only a fraction 
of mg (namely mg sin 0 cos 6 , where 0 is the angle of the ground). X 

Conservative forces 

Given any force (it can depend on x, v, t, and/or whatever), the work it does on a 
particle is defined by W = f F dx. If the particle starts at x\ and ends up at x^, 
then no matter how it gets there (it can speed up or slow down, or reverse direction 
a few times, perhaps due to the influence of another force), we can calculate the 
work done by the given force and equate the result with the change in kinetic energy, 



For some forces, the work done is independent of how the particle moves. A 
force that depends only on position (in one dimension) has this property, because 
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the integral in eq. (4.4) depends only on the endpoints. The W = f F dx integral is 
simply the area under the F vs. x graph, and this area is independent of how the 
particle goes from x\ to x' 2 - 

For other forces, the work done depends on how the particle moves. Such is the 
case for forces that depend on t or v, because it then matters when or how quickly 
the particle goes from x\ to x' 2 - An common example of such a force is friction. If 
you slide a brick across a table from x\ to x' 2 , then the work done by friction equals 
— gmg\Ax\. But if you slide the brick by wiggling it back and forth for an hour 
before you finally reach X 2 , then the amount of negative work done by friction will 
be very large. Since friction always opposes the motion, the contributions to the 
W = f F dx integral are always negative, so there is never any cancellation. The 
result is therefore a large negative number. 

The issue with friction is that the gmg force isn’t a function only of position, 
because at a given location the friction can point to the right or to the left, depending 
on which way the particle is moving. Friction is therefore a function of velocity. 
True, it’s a function only of the sign of the velocity, but that’s enough to ruin the 
position-only dependence. 

We now define a conservative force as one for which the work done on a particle 
between two given points is independent of how the particle makes the journey. From 
the preceding discussion, we know that a one-dimensional force is conservative if and 
only if it depends only on x (or is constant). 2 

The point we’re leading up to here is that although we can define the work done 
by any force, we can only talk about potential energy associated with a force if the 
force is conservative. This is true because we want to be able to label each value 
of x with a unique number, V(x), given by V(x) = — f* g F dx. If this integral were 
dependent on the path, then it wouldn’t be well-defined, so we wouldn’t know what 
number to assign to V(x). We therefore talk only about potential energies that are 
associated with conservative forces. In particular, it makes no sense to talk about 
the potential energy associated with a friction force. 

Work vs. potential energy 

When you drop a ball, does its speed increase because the gravitational force is 
doing work on it, or because its gravitational potential energy is decreasing? Well, 
both (or more precisely, either). Work and potential energy are two different ways 
of talking about the same thing (at least for conservative forces). Either method of 
reasoning will give the correct result. However, be careful not to use both reasonings 
and “double count” the effect of gravity on the ball. 

Which terminology you use depends on what you call your “system”. Just as 
with F = ma and free-body diagrams, it is important to label your system when 
dealing with work and energy. 

The work-energy theorem stated in Theorem 4.1 was relevant to one particle. 
What if we are dealing with the work done on a system that is composed of various 

2 In two or three dimensions, however, we will see in Section 4.3.1 that a conservative force must 
satisfy another requirement, in addition to being dependent only on position. 
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parts? The general work-energy theorem says that the work done on a system by 
external forces equals the change in energy of the system. This energy many come 
in the form of kinetic energy, or internal potential energy, or heat (which is really 
just kinetic energy). For a point particle, there is no internal structure (so we’ll 
assume it can’t heat up), so this general form of the theorem reduces to Theorem 
4.1. But to see what happens when a system has internal structure, consider the 
following example. 


Example (Raising a book): You lift a book up at constant speed, so there is no 
change in kinetic energy. Let’s see what the general work-energy theorem says for 
various choices of the system. 

• System = (book): Both you and gravity are external forces, and there is no 
change in energy of the book as a system in itself. So the W-E theorem says 

W you + W grav = 0 <^=> mgh + (- mgh ) = 0. (4.10) 

• System = (book + earth): Now you are the only external force. The gravita¬ 
tional force between the earth and the book is an internal force which produces 
and internal potential energy. So the W-E theorem says 

Wyou = AFearth-book <*==> rngh = mgh. (4-11) 

• System = (book + earth + you): There is now no external force. The internal 
energy of the system changes because the earth-book gravitational potential 
energy increases, and also because your potential energy decreases. In order to 
lift the book, you have to burn some calories from the dinner you ate. So the 
W-E theorem says 

0 = AF earth _ bo ok + AVyou 0 = mgh + (-mgh). (4.12) 

Actually, a human body isn’t 100% efficient, so what really happens here is that 
your potential energy decreases by more than mgh, but heat is produced. The 
sum of these two changes in energy equals —mgh. 


The moral of all this is that you can look at a setup in various ways. Potential 
energy in one way might be work in another. In practice, it is usually more con¬ 
venient to work in terms of potential energy. So for a dropped ball, people usually 
consider gravity to be an internal force in the earth-ball system, as opposed to an 
external force on the ball system. 


4.2 Small Oscillations 

Consider an object in one dimension, subject to the potential V(x). Let the object 
initially be at rest at a local minimum of V(x ), and then let it be given a small kick 
so that it moves back and forth around the equilibrium point. What can we say 
about this motion? Is it harmonic? Does the frequency depend on the amplitude? 
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It turns out that for small amplitudes, the motion is indeed simple harmonic 
motion, and the frequency can easily be found, given V(x). To see this, expand 
V(x ) in a Taylor series around the equilibrium point, xo- 

V(x) = V(x 0 )+V'(xo){x-xo)+^V"{xo)(x-xo) 2 +^V"'(xo)(x-x 0 ) 3 +■ ■ ■ . (4.13) 

This looks like a bit of a mess, but we can simplify it greatly. V(xq) is an 
irrelevant additive constant. We can ignore it because only differences in energy 
matter (or equivalently, because F = —dV/dx). And V(xq) = 0, by definition of the 
equilibrium point. So that leaves us with the V"(xq) and higher-order terms. But for 
sufficiently small displacements, these higher-order terms are negligible compared 
to the V"(xq) term, because they are suppressed by additional powers of (x — xq). 
So we are left with 3 

V(x) fa ^V"{x 0 ){x - x 0 ) 2 . (4.14) 

But this looks exactly like the Hooke’s-law potential, V(x) = (l/2)k(x — xq) 2 . pro¬ 
vided that we let V"{xq) be our “spring constant,” k. The frequency of small 
oscillations, u = y/kjm, therefore equals 


Example: A particle moves under the influence of the potential V(x) = A/x 2 — B jx. 
Find the frequency of small oscillations around the equilibrium point. This potential 
is relevant to planetary motion, as we will see in Chapter 6. 


Solution: The first thing 
have 


need to do is calculate the equilibrium point, xq. We 

9 A R 


Therefore, V'(x) = 0 when x = 2 A/B = xq■ The second derivative of V(x) is 


V"{x) = 


6A 


2 B 

~^3 ■ 


Plugging in xq = 2 A/B, we find 


(4.17) 


(4.18) 


Eq. (4.15) is an important result, because any function V(x) looks basically like 
a parabola (see Fig. 4.3) in a small enough region around a minimum (except in 
the special case where V"(xq) = 0). 

3 Even if V'"(xo) is much larger than V"(xo), we can always pick (x — xo) small enough so that 
the V"'(xo) term is negligible. The one case where this is not true is when V"(xo) == T. But the 
result in eq. (4.15) is still valid in this case. The frequency w just happens to be zero. 



Figure 4.3 
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A potential may look quite erratic, 

And its study may seem problematic. 

But down near a min, 

You can say with a grin, 

“It behaves like a simple quadratic!” 

4.3 Conservation of energy in 3-D 

The concepts of work and potential energy in three dimensions are slightly more 
complicated than in one dimension, but the general ideas are the same. As in the 1-D 
case, we start with Newton’s second law, which now takes the vector form, F = ma. 
And as in the 1-D case, we will deal only with forces that depend only on position, 
that is, F = F(r). This vector equation is shorthand for three equations analogous 
to eq. (4.1), namely rnv x (dv x /dx) = F x . and likewise for y and z. Multiplying 
through by dx, etc., in these three equations, and then adding them together gives 

F x dx + Fydy + F z dz = m(v x dv x + v y dv y + v z dv z ). (4-19) 

Integrating from the point (xo, yo; A)) to the point ( x,y,z ) yields 

E+ J F x dx + J Fydy-\- J F z dz = )^m(v 2 + v 2 + v 2 ) = ^mv 2 , (4.20) 

where E is a constant of integration. 4 Note that the integrations on the left-hand 
side depend on what path in 3-D space the particle takes in going from (xo,yo,zo) 
to ( x,y,z ). We will address this issue below. 

With dr = (dx, dy, dz), the left-hand side of eq. (4.19) is equal to F • dr. Hence, 
eq. (4.20) may be written as 

\mv 2 - r F(r') • dr' = E. (4.21) 

2 J r 0 

Therefore, if we define the potential energy, F(r), as 

F(r) = — f T F(r') • dr', (4.22) 

J ro 

then we may write 

^mv 2 + F(r) = E. (4.23) 

In other words, the sum of the kinetic energy and potential energy is constant. 

technically, we should put primes on the integration variables so that we don’t confuse them 
with the limits of integration, but this gets too messy. 
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4.3.1 Conservative forces in 3-D 

For a force that depends only on position (as we have been assuming), there is one 
complication that arises in 3-D that we didn’t have to worry about in 1-D. In 1- 
D, there is only one route that goes from xo to x. The motion itself may involve 
speeding up or slowing down, or backtracking, but the path is always restricted to 
be along the line containing xo and x. But in 3-D, there is an infinite number of 
routes that go from ro to r. In order for the potential, V(r ), to have any meaning 
and to be of any use, it must be well-defined. That is, it must be path-independent. 
As in the 1-D case, we call the force associated with such a potential a conservative 
force. Let’s now see what types of 3-D forces are conservative. 


Theorem 4.2 Given a force F(r), a necessary and sufficient condition for the po¬ 
tential, 

V (r) = - / F(r') • dr', (4.24) 

J ro 

to be well-defined (that is, to be path-independent) is that the curl of F is zero (that 
is, V x F = Oj. 5 

Proof: First, let us show that V x F = 0 is a necessary condition for path- 
independence. In other words, “If F(r) is path-independent, then V x F = 0.” 

Consider the infinitesimal rectangle shown in Fig. 4.4. This rectangle lies 
in the x-y plane, so in the present analysis we will suppress the ^-component of 
all coordinates, for convenience. If the potential is path-independent, then the 
work done in going from (X,Y) to ( X + dX,Y + dY), which equals the integral 
fF-dr, must be path-independent. In particular, the integral along the segments 
“1” and “2” must equal the integral along the segments “3” and “4”. That is, 
J] F y dy + f 2 F x dx = f 3 F x dx + f 4 F y dy. Therefore, a necessary condition for path- 
independence is 


t 


® 


{Xx dX, Y+dY) 

© J 
© 


| (X Y) 


© 


Figure 4.4 


J F x dx — J F x dx = J Fydy — J F y dy ==> 

rX+dX 

Jx [F x (x, Y + dY)- F x (x, Y) J dx 

= £^ (F y (X + dX, y) - F y (X , y)) dy. 


F x (x, Y + dY)- F x (x, Y) to dY 


dF x (x,y) 

dy 


_ d F x (x,y) 

e,v) ~ Q y ( 


(4.25) 

(4.26) 


The first approximation holds due to the definition of the partial derivative. The 
second approximation holds because our rectangle is small enough so that x is 

5 If you haven’t seen curl before, it’s defined below in eq. (4.30). But there is actually no need to 
be familiar with the definition of curl here, because it is, after all, just a definition. The important 
result that we will be deriving is the equality to the right of the sign in eq. (4.30). 
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Figure 4.5 


essentially equal to X. Any errors due to this approximation will be second-order 
small, because we already have one factor of dY in our term. 

A similar treatment works for the F y terms, so eq. (4.25) becomes 


dy 


\(X,Y) 


dy. 


\{X,Y) 


(4.27) 


The integrands here are constants, so we can quickly perform the integrals to obtain 


fdF x (x,y) dF y (x,y)\ 


dy 


dx 


(4.28) 


Cancelling the dXdY factor, and noting that (X. Y) is an arbitrary point, we see 
that if the potential is path-independent, then we must have 


dF x (x,y ) _ dF y (x,y) 
dy dx 


(4.29) 


at any point (x. y). 

The preceding analysis also works, of course, for little rectangles in the x-z and 
y-z planes. We therefore obtain two other analogous conditions for the potential to 
be well-defined. All three conditions may be concisely written as 


/ dF z _ dF y dF x _dF\ dFy _ dF x \ 
V dy dz ’ dz dx ’ dx dy ) 


(4.30) 


We have therefore shown that V x F = 0 is a necessary condition for path indepen¬ 
dence. Let us now show that it is sufficient. In other words, “If V x F = 0, then 
F(r) is path-independent.” 

The proof of sufficiency follows immediately from Stokes’ theorem (but see the 
remark below for another proof), which states that (see Fig. 4.5) 


£ F(r) • dr = J(V x F) • dA. (4.31) 

Here, C is an arbitrary closed curve, which we make pass through ro and r. S' is an 
arbitrary surface that has C as its boundary. And dA has a magnitude equal to an 
infinitesimal piece of area on S and a direction defined to be orthogonal to S. 

Eq. (4.31) implies that if V x F = 0 everywhere, then f c F(r) • dr = 0 for any 
closed curve. But Fig. 4.5 shows that traversing the loop C entails traversing path 
“1” in the “forward” direction, and then traversing path “2” in the “backward” 
direction. Hence, F • dr — f 2 F • dr = 0, where both integrals run from ro to r. 
Therefore, any two paths from r 0 to r give the same integral, as we wanted to show. 


Remarks: 

1. If you don’t like invoking Stokes’ theorem, then you can just back up a step and 
prove it from scratch. Here’s the rough idea of the proof. For simplicity, pick a path 
confined to the x-y plane (the general case proceeds in the same manner). For the 
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purposes of the dx and dy integrations, any path can be approximated by a series of 
little segments parallel to the coordinate axes (see Fig. 4.6). 

Now imagine integrating f F • dr over every little rectangle in the figure (in a coun¬ 
terclockwise direction). The result can be viewed in two ways: (1) From the above 
analysis leading to eq. (4.28), each integral gives the curl times the area of the rectan¬ 
gle. So whole integral gives f s (V x F) dA. (2) Each interior line gets counted twice (in 
opposite directions) in the whole integration, so these contributions cancel. We are 
therefore left with the integral over only the edge segments, which gives § c F(r) • dr. 
Equating these two ways of looking at the integration gives Stokes’ theorem in eq. 
(4.31). 

2. Another way to show that V x F = 0 is a necessary condition for path-independence 
(that is, “If V(r) is path-independent, then V x F = 0.”) is the following. If V(r) 
is path-independent (and therefore well-defined), then it is legal to write down the 
differential form of eq. (4.22). This is 


dV( r) = —F(r) • dr = ~{F X dx + F y dy + F z dz). (4.32) 


But another expression for dV is 


(4.33) 


The previous two equations must be equivalent for arbitrary dx , dy, and dz. So we 
have 


(F x ,F y ,F z ) 


F(r) 


_fdV dV dV\ 
V dx ' dy' dz ) 
-W(r). 


(4.34) 


In other words, the force is simply the gradient of the potential. Therefore, 


V x F = -V x W(r) = 0, (4.35) 

because the curl of a gradient is identically zero, as you can explicitly verify. & 



Figure 4.6 


Example (Central force): A central force is defined to be a force that points 
radially and whose magnitude depends only on r. That is, F(r) = F(r) r. Show that 
a central force is a conservative force by explicitly showing that V x F = 0. 

Solution: F may be written as 

F (x,y,z) = F(r)r = F(r) (* 

Now, as you can verify, 

dr _ d\/x 2 +y 2 + z 2 _ 
dx dx 

and similarly for y and 2 . Therefore, the 2 component of V x F equals (writing F for 
F(r), and F' for dF(r)/dr) 

dFy_dF x _ d(yF/r) d(xF/r) 
dx dy dx dy 


(4.36) 
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(y F '?L- vF 

\r dx y r 2 dx) \r dy r 2 dy) 

OyxF' _ yxF \ _ / xyF' _ xyF \ _ Q 


(4.38) 


Likewise for the x- and y-cornponents. 


4.4 Gravity 


4.4.1 Gravity due to a sphere 


The gravitational force on a point-mass m, located a distance r from a point-mass 
M, is given by Newton’s law of gravitation, 


, . -GMm 
F(r) = - 2 - 

r Z 


(4.39) 



Figure 4.7 


where the minus sign indicates an attractive force. What is the force if we replace 
the point mass M by a sphere of radius R and mass Ml The answer (assuming 
that the sphere is spherically symmetric, that is, the density is a function only of r) 
is that it is still —GMm/r 2 . A sphere acts just like a point mass at its center, for 
the purposes of gravity. This is an extremely pleasing result, to say the least. If it 
were not the case, then the universe would be a far more complicated place than it 
is. In particular, the motion of planets and such things would be much harder to 
describe. 

To prove this result, it turns out to be much easier to calculate the potential 
energy due to a sphere, and to then take the derivative to obtain the force, rather 
than to calculate the force explicitly. 6 So this is the route we will take. It will suffice 
to demonstrate the result for a thin spherical shell, because a sphere is the sum of 
many such shells. 

Our strategy for calculating the potential energy at a point P, due to a spherical 
shell, will be to slice the shell into rings as shown in Fig. 4.7. Let the radius of the 
shell be R. Let P be a distance r from the center of the shell, and let the ring make 
the angle 6 shown. 

The distance, l, from P to the ring is a function of R, r, and 9. It may be found 
as follows. In Fig. 4.8, segment AB has length Asm 9. and segment BP has length 
r — Rcos9. So the length t in triangle ABP is 



Figure 4.8 


I = \J (I? sin 0) 2 + (r — Rcos9) 2 = \/ R 2 + r 2 — 2 rR cos 9. (4.40) 

What we’ve done here is just prove the law of cosines. 

The area of a ring between 9 and 9 + d9 is its width (which is R d9 ) times its 
circumference (which is 27ri?sin#). Letting a = M/(4nR 2 ) be the mass density of 

6 The reason for this is that the potential energy is a scalar quantity (just a number), whereas 
the force is a vector. If we tried to calculate the force, we would have to worry about forces pointing 
in all sorts of directions. With the potential energy, we simply have to add up a bunch of numbers. 
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the shell, we see that the potential energy of a mass m at P due to a thin ring is 
— Gmcr(Rd6)(2irR sin 9)/l. This is true because the gravitational potential energy, 


V(r) = 


—Gmim2 

e 


(4.41) 


is a scalar quantity, so the contributions from the little mass pieces simply add. 
Every piece of the ring is the same distance from P, and this distance is all that 
matters; the direction from P is irrelevant (unlike it would be with the force). The 
total potential energy at P is therefore 


V(r) 


-i: 


2tt aGR 2 m sin 0 d6 
y/R 2 + r 2 — 2rRcos9 

+ r 2 _ 2rflcos0 I 


(4.42) 


Note that the sin# in the numerator is what made this integral nice and doable. We 
must now consider two cases. If r > R, then we have 


Tr , . 27 xaGRm f, G(4irR 2 a)m GMm tn . 

V(r) =---(^(r + R) — (r — R)J = - v —-—’ =--— , (4.43) 

which is the potential due to a point-mass M located at the center of the shell, as 
desired. If r < R, then we have 

2'KaGRm / \ G(4TrR 2 a)m GMm 

V( r ) = ---((r + R) — (R — r)J =- K r 1 = -— , (4.44) 

which is independent of r. 

Having found V(r), we can now find F(r) by simply taking the negative of the 
gradient of V. The gradient is just ? (d/dr) here, because V is a function only of r. 
Therefore, 


F(r) 

F(r) 


GMm , f 
r 2 ’ 

0, if r < R. 


> R, 


(4.45) 


These forces are directed radially, of course. A sphere is the sum of many spherical 
shells, so if P is outside a given sphere, then the force at P is —GMm/r 2 , where 
M is the total mass of the sphere. This result will still hold even if the shells have 
different mass densities (but each one must have uniform density). 


Newton looked at the data, numerical, 

And then observations, empirical. 

He said, “But, of course, 

We get the same force 

From a point mass and something that’s spherical!” 
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M m t 

9 R V 

Figure 4.10 


If P is inside a given sphere, then the only relevant material is the mass inside 
a concentric sphere through P, because all the shells outside this region give zero 
force, from the second equation in eq. (4.45). The material “outside” of P is, for 
the purposes of gravity, not there. 

It is not obvious that the force inside a spherical shell is zero. Consider the point 
P in Fig. 4.9. A piece of mass, dm, on the right side of the shell gives a larger force 
on P than a piece of mass, dm, on the left side, due to the 1/r 2 dependence. But 
from the figure, there is more mass on the left side than the right side. These two 
effects happen to exactly cancel, as you can show in Problem 9. 

Note that the gravitational force between two spheres is the same as if they were 
replaced by two point-masses. This follows from two applications of our “point- 
mass” result. 


4.4.2 Tides 

The tides on the earth exist because the gravitational force from a point mass (or a 
spherical object, in particular the moon or the sun) is not uniform. The direction of 
the force is not constant (the force lines converge to the source), and the magnitude 
is not constant (it falls off like 1/r 2 ). On the earth, these effects cause the oceans 
to bulge around the earth, producing the observed tides. 

The study of tides is useful in part because tides are a very real phenomenon 
in the world, and in part because the following analysis gives us an excuse to make 
lots of approximations with Taylor series and such. Before considering the general 
case of tidal forces, let’s look at two special cases. 


Longitudinal tidal force 

In Fig. 4.10, two particles of mass m are located at points (I?, 0) and ( R+ m, 0), 
with x <C R. A planet of mass M is located at the origin. What is the difference 
between the gravitational forces acting on these two masses? 

The difference in the forces is (using x <C R to make suitable approximations) 


—GMm 
(R + x) 2 


-GMm _ -GMm 
R 2 ~ R? + 2 Rx 
GMm / 


R 2 


GMm _ GMm ( 
R 2 ~ R 2 V 
(1 - 2x/R) + l) = 


1 + 2 x/R + \ 

2 GMmx 

R? ' 


(4.46) 


This is, of course, simply the derivative of the force, times x. This difference points 
along the line joining the masses, and its effect is to pull the masses apart. 

We see that this force difference is linear in the separation x, and inversely 
proportional to the cube of the distance from the source. This force difference is 
the important quantity (as opposed to the force on each mass) when we are dealing 
with the relative motion of objects in free-fall around a given mass (for example, 
circular orbiting motion, or radial falling motion). This force difference is referred 
to as the “tidal force.” 

Consider two people, A and B, both of mass m, in radial free-fall toward a 
planet. Imagine that they are connected by a string, and enclosed in a windowless 
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box. Neither can feel the gravitational force acting on him (for all they know, 
they are floating freely in space). But they each feel a tension in the string equal 
to T = GMrnx /1? 3 (neglecting higher-order terms in x/R ), pulling in opposite 
directions. The difference in these tension forces is 2T, which exactly cancels the 
difference in the gravitational force, thereby allowing the separation to remain fixed. 

How do A and B view the situation? They will certainly feel the tension force. 
They will therefore conclude that there must be some other mysterious “tidal force” 
that opposes the tension, yielding a total net force of zero, as measured in their 
windowless box. 


Transverse tidal force 


In Fig. 4.11, two particles of mass m are located at points (I?, 0) and ( R,y ), with 
y <C R. A planet of mass M is located at the origin. What is the difference between 
the gravitational forces acting on these two masses? 

Both masses are the same distance R from the origin, up to second-order effects 
in y/R (using the Pythagorean theorem), so the magnitudes of the forces on them 
are essentially the same. The direction is the only thing that is different, to first 
order in y/R. The difference in the forces is the ^/-component of the force on the 
top mass. The magnitude of this component is 


GMm . „ GMm (y\ GMmy 


(4.47) 


This difference points along the line joining the masses, and its effect is to pull the 
masses together. As in the longitudinal case, the transverse tidal force is linear in 
the separation y, and inversely proportional to the cube of the distance from the 
source. 


General tidal force 


We will now calculate the tidal force at an arbitrary point on a circle of radius r 
centered at the origin (this circle represents the earth), due to a mass M located 
at the vector —R; see Fig. 4.12. We will calculate the tidal force relative to the 
origin. Note that the vector from M to a point P on the circle is R + r. And as 
usual, assume l r l« |R| - 

The attractive gravitational force may be written as F(x) = —GMmx./ |x| 3 , 
where x is the vector from M to the point in question. The cube is in the denom¬ 
inator because the vector in the numerator contains one power of the distance. In 
the present case we have x = R + r, so the desired difference between the force on a 
mass m at point P and the force on a mass m at the origin is the tidal force Ft(r) 
given by 


F t (r) _ —(R + r) -R 
GMm ~ |R + r| 3 |R| 3 ' 


(4.48) 


This is the exact expression for the tidal force. However, it is completely useless. 7 



R 

Figure 4.11 



Figure 4.12 


7 This reminds me of a joke about two people lost in a hot-air balloon. 
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Let us therefore make some approximations in eq. (4.48) and transform it into 
something technically incorrect (as approximations tend to be), but far more useful. 

The first thing we need to do is rewrite the |R + r| term. We have (using r <C R 
and ignoring higher-order terms) 


|R + r| = ^(R + r) • (R + r) 
= VR 2 + r 2 + 2R • r 
« Ry/l + 2R • r/R 2 


Therefore (again using r«B), 



F t( r ) 

R + r 

R 

+ R 3 

GMm 

R 3 (l + R • r/R 2 ) 3 


R + r 

R 


R 3 (l + 3R • r/R 2 ) 

+ R 3 


R + r / 3R • r\ R 

V ~ ) + R? 


(4.49) 


(4.50) 


Letting R = R/R, we finally have (once again using rCJ?) 

_ . . GMm{ 3R(R ■ r) - r) 

F,(r)S! - & -- 


(4.51) 



Figure 4.13 


This is the general expression for the tidal force. We can put it in a simpler form 
if we let M lie on the negative x-axis, which can arrange for with a rotation of the 
axes. We then have R = x, and so R • r = x. Eq. (4.51) then tells us that the tidal 
force at the point P = (x, y) equals 

F t(r) ~ r3 (3a^x - (xx + yy)j = r3 (2x, -y). (4.52) 

This reduces properly in the two special cases considered above. The tidal forces at 
various points on the circle are shown in Fig. 4.13. 

If the earth were a rigid body, then the tidal force would have no effect on it. 
But the water in the oceans is free to slosh around. The water on the earth bulges 
along the line from the earth to the moon, and also along the line from the earth 
to the sun. As the earth rotates beneath the bulge, a person on the earth sees the 
bulge rotate relative to the earth. From Fig. 4.13, we see that this produces two 
high tides and two low tides per day. It’s actually not exactly two per day, because 
the moon moves around the earth. But this motion is fairly slow, taking about a 
month, so it’s a reasonable approximation for the present purposes to think of the 
moon as motionless. 

Note that it is not the case that the moon pushes the water away on the far side 
of the earth. It pulls on that water, too; it just does so in a weaker manner than it 
pulls on the rigid part of the earth. Tides are a comparative effect. 
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Remarks: 

1. Consider two equal masses separated by a given distance on the earth. It turns out 
that the gravitational force from the sun on them is (much) larger than that from the 
moon, whereas the tidal force from the sun on them is (slightly) weaker than that 
from the moon. Quantitatively, the ratio of the gravitational forces is 


F s (gM s \ fi 

f GM n \ 

6 • 10 -3 m/s 2 

Fm \Rls) / \ 

\ M J 

_ 3.4 • 10 -5 m/s s 

And the ratio of the tidal forces is 



F,,s = (GMs \ / 

' (gm m 

\ 4-10- 14 s“ 2 

Ft,M y s J j 

\ F|,m , 

j " 9 • 10 -14 s -2 


(4.53) 


(4.54) 


2. Eq. (4.54) shows that the moon’s tidal effect is roughly twice the sun’s. This has an 
interesting implication about the densities of the moon and sun. Note that the tidal 
force from, say, the moon is proportional to 


(gm m \ 

\ ^!,m / 


/ G(| 7 rrg 1 )p M \ 

\ ^E,M / 


oc p M 


/ JgC 


~ Pm@m, 


(4.55) 


where 9m is half of the angular size of the moon in the sky. Likewise for the sun’s 
tidal force. But it just so happens that the angular sizes of the sun and the moon are 
essentially equal, as you can see by looking at them (preferably through some haze), 
or by noting that total solar eclipses barely exist. Therefore, the combination of eq. 
(4.54) and eq. (4.55) tells us that the moon’s density is about twice the sun’s. X 


4.5 Momentum 

4.5.1 Conservation of momentum 


Newton’s third law says that for every force there is an equal and opposite force. 
More precisely, if F a & is the force that particle a feels due to particle b, and if F i m is 
the force that particle b feels due to particle a, then Fb a = —F ab at all times. 

This law has important implications concerning momentum. Consider two par¬ 
ticles that interact over a period of time. Assume that they are isolated from outside 
forces. From Newton’s second law, 


F 


dp 

dt 


(4.56) 


we see that the total change in a particle’s momentum equals the time integral of 
the force acting on it. That is, 


fh 

p(t 2 ) - p(h) = / F dt. 

■hi 


(4.57) 


This integral is called the impulse. If we now invoke the third law, F }, a = —F a &, we 
find 


Pafti) - Po(ii) = [ F abdt 

Jti 

= - j F ba dt = ~(pb(h) ~ P&Oi))- (4.58) 
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Therefore, 


Pa(*2) + P b(h) = Pa(*l) + Pb(h)- 


(4.59) 


In other words, the total momentum of this isolated system is conserved. It does 
not depend on time. Note that eq. (4.59) is a vector equation, so it is really three 
equations, namely conservation of p x , p y , and p z . 


Example (Splitting mass): A mass M moves with speed V in the ^-direction. It 
explodes into two pieces that go off at angles 9\ and 9%, as shown in Fig. 4.14 What 
are the magnitudes of the momenta of the two pieces? 

Solution: Let P = MV be the initial momentum, and let p-\ and p 2 be the final 
momenta. Conservation of momentum in the x- and y-directions gives, respectively, 


Pi cos 9\ + P 2 cos 02 = P, 

Pi sin0i — p 2 sin 0 2 = 0. (4.60) 


Solving for pi and p 2 , and using a trig addition formula, gives 
P sin 0 2 P sin 0i 


sin(0i + 0 2 ) 


sin(0i + 0 2 ) 


(4.61) 


Let’s check a few limits. If 0i = 0 2 , then pi = p 2 , as it should. If, in addition, 0i 
and 02 are both small, then pi = p 2 « P/2, as they should. If, on the other hand, 
9i = 0 2 « 90°, then pi and p 2 are both very large; the explosion must have provided 
a large amount of energy. 

Note that with the given information, we can’t determine what the masses of the two 
pieces are. To find these, we would need to know two more pieces on information, such 
as how much energy the explosion gave to the system, and what one of the masses or 
speeds is. Then we would have an equal number of equations and unknowns. 


Remark: Newton’s third law makes a statement about forces. But force is defined 
in terms of momentum via F = dp/dt. So the third law essentially postulates conservation 
of momentum. (The “proof” above in eq. (4.58) is hardly a proof. It involves one simple 
integration.) So you might wonder if momentum conservation is something you can prove, 
or if it’s something you have to assume (as we have basically done). 

The difference between a postulate and a theorem is rather nebulous. One person’s 
postulate might be another person’s theorem, and vice-versa. You have to start somewhere 
in your assumptions. We chose to start with the third law. In the Lagrangian formalism 
in Chapter 5, the starting point is different, and momentum conservation is deduced as a 
consequence of translational invariance (as we will see). So it looks more like a theorem in 
that formalism. 

But one thing is certain. Momentum conservation of two particles cannot be proven from 
scratch for arbitrary forces, because it is not necessarily true. For example, if two charged 
particles interact in a certain way through the magnetic fields they produce, then the total 
momentum of the two particles might not be conserved. Where is the missing momentum? 
It is carried off in the electromagnetic field. The total momentum of the system is indeed 
conserved, but the fact of the matter is that the system consists of the two particles plus 
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the electromagnetic field. Said in another way, each particle actually interacts with the 
electromagnetic field, and not the other particle. Newton’s third law does not necessarily 
hold for particles subject to such a force. & 

Let’s now look at momentum conservation for a system of many particles. As 
above, let E tJ be the force that particle i feels due to particle j. Then F ^ = -F J( ; 
at all times. Assume the particles are isolated from outside forces. 

The change in the momentum of the ith particle from t\ to t '2 is (we won’t bother 
writing all the t's in the expressions below) 


Api 



dt. 


(4.62) 


Therefore, the change in the total momentum of all the particles is 


But Yli JCj F ij = 0 at all times, because for every term F a ;,, there is a term F; )a , and 
F a b + F;, a = 0. (And also, F aa = 0.) Therefore, the total momentum of an isolated 
system of particles is conserved. 


4.5.2 Rocket motion 


The application of momentum conservation becomes a little more exciting when the 
mass to is allowed to vary. Such is the case with rockets, because most of their mass 
consists of fuel which is eventually ejected. 

Let mass be ejected with speed u relative to the rocket, 8 at a rate dm/dt. We’ll 
define the quantity dm to be negative, so during a time dt the mass dm gets added 
to the rocket’s mass. (If you wanted, you could define dm to be positive, and then 
subtract it from the rocket’s mass. Either way is fine.) Also, we’ll define u to be 
positive, so the ejected particles lose a speed u relative to the rocket. It may sound 
silly, but the hardest thing about rocket motion is picking a sign for these quantities 
and sticking with it. 

Consider a moment when the rocket has mass to and speed v. Then at a time dt 
later (see Fig. 4.15), the rocket has mass m+dm and speed v+dv, while the exhaust 
has mass (—dm) and speed v — u (which may be positive or negative, depending on 
the relative size of v and u). There are no external forces, so the total momentum 
at each of these times must be equal. Therefore, 



mv = (to + dm)(v + dv) + (— dm)(v — u). (4.64) Figure 4.15 


8 Just to emphasize, u is the speed with respect to the rocket. It wouldn’t make much sense to 
say “relative to the ground,” because the rocket’s engine spits out the matter relative to itself, and 
the engine has no way of knowing how fast the rocket is moving with respect to the ground. 
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Ignoring the second-order term yields mdv = —u dm. Dividing by m and integrating 
from t\ to t '2 gives 


/: 


*-r 


m — vi = u\n — . (4.65) 

m 2 


For the case where the initial mass is M and the initial speed is 0, we have v = 
uln(M/m). And if dm/dt happens to be constant (call it —rj, where r) is positive), 
then v(t) = uln[M/(M — rjt )]. 

The log in the result in eq. (4.65) is not very encouraging. If the mass of the 
metal in the rocket is m, and if the mass of the fuel is 9m, then the final speed is 
only nln 10 « (2.3)u. If the mass of the fuel is increased by a factor of 11 up to 99m 
(which is probably not even structurally possible, given the amount of metal required 
to hold it), then the final speed only doubles to nln 100 = 2(u In 10) ~ (4.6)u. How 
do you make a rocket go significantly faster? Exercise 33 deals with this question. 

Remark: If you want, you can solve this rocket problem by using force instead of 
conservation of momentum. If a chunk of mass (—dm) is ejected out the back, then its 
momentum changes by udm (which is negative). Since force equals the rate of change of 
momentum, the force on this chunk is u dm/dt. By Newton’s third law, the remaining 
part of the rocket feels a force of —udm/dt (which is positive). This force accelerates the 
remaining part of the rocket, so F = ma gives —u dm/dt = mdv/dt , 9 which is equivalent 
to the mdv = —udm result above. 

We see that this rocket problem can be solved by using either force or conservation 
of momentum. In the end, these two strategies are really the same, because the latter 
was derived from F = dp/dt. But the philosophies behind the approaches are somewhat 
different. The choice of strategy depends on personal preference. In an isolated system such 
as a rocket, conservation of momentum is usually simpler. But in a problem involving an 
external force, F = dp/dt is the way to go. You’ll get lots of practice with F = dp/dt in the 
problems for this section and also in Section 4.8. 

Note that we used both F = dp/dt and F = ma in this second solution to the rocket 
problem. These are not equal if the mass of a particle changes. For further discussion on 
which expression to use in a given situation, see Appendix E. A 



S 


4.6 The CM frame 

4.6.1 Definition 

When talking about momentum, it is understood that a certain frame of reference 
has been picked. After all, the velocities of the particles have to be measured with 
respect to some coordinate system. Any inertial (that is, non-accelerating) frame is 
as good as any other, but we will see that there is one particular reference frame 
that is often advantageous to use. 

Consider a frame S and another frame S' that moves at constant velocity u with 
respect to S (see Fig. 4.16). Given a system of particles, the velocity of the ith 

9 Whether we use m or m + dm here for the mass 
are of second order. 


Figure 4.16 


of the rocket doesn’t matter. Any differences 
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particle in S is related to its velocity in S' by 


(4.66) 


It is then easy to see that if momentum is conserved during a collision in frame S', 
then it is also conserved in frame S. This is true because both the initial and final 
momenta of the system in S are increased by the same amount (E m «) u > compared 
to what they are in .S'. 10 

Let us therefore consider the unique frame in which the total momentum of a 
system of particles is zero. This is called the center of mass frame, or CM frame. If 
the total momentum is P = E my l in frame S, then the CM frame S' is the frame 
that moves with velocity 


P _ E"h v i 
M = M 


(4.67) 


with respect to S, where M = E is the total mass. This is true because we can 
use eq. (4.66) to write 


P' = ^m;v' 



= P-P = 0. (4.68) 


The CM frame is extremely useful. Physical processes are generally much more 
symmetrical in this frame, and this makes the results more transparent. 

The CM frame is also sometimes called the “zero-momentum” frame. But the 
“center of mass” name is commonly used because the center of mass of the particles 
does not move in the CM frame, defined by the velocity in eq. (4.67). The position 
of the center of mass is given by 


Rcm = Ep ■ («9) 

This is the location of the pivot upon which a rigid system would balance, as we will 
see in Chapter 7. The fact that the CM doesn’t move in the CM frame follows from 
the fact that the derivative of Rcm is simply the velocity of the CM frame in eq. 

(4.67). The center of mass may therefore be chosen as the origin of the CM frame. 

Along with the CM frame, the other frame that people generally work with is 
the lab frame. There is nothing at all special about this frame. It is simply the 
frame (assumed to be inertial) in which the conditions of the problem are given. 
Any inertial frame can be called the “lab frame.” Solving problems often involves 
switching back and forth between the lab and CM frames. For example, if the final 
answer is requested in the lab frame, then you may want to transform the given 

^Alternatively, nowhere in our earlier derivation of momentum conservation did we say what 
frame we were using. We only assumed that the frame was not accelerating. If it were accelerating, 
then F would not equal ma. We will see in Chapter 9 how F = ma is modified in a non-inertial 
frame. But no need to worry about that here. 
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Figure 4.17 


information from the lab frame into the CM frame where things are more obvious, 
and then transform back to the lab frame to give the answer. 


Example (Two masses in 1-D): A mass m with speed v approaches a stationary 
mass M (see Fig. 4.17). The masses bounce off each other without any loss in total 
energy. What are the final velocities of the particles? Assume all motion takes place 
in 1-D. 


Solution: Doing this problem in the lab frame would require a potentially messy 
use of conservation of energy (see the example in Section 4.7.1). But if we work in 
the CM frame, things are much easier. 

The total momentum in the lab frame is mv, so the CM frame moves to the right 
with speed mv/(m + M) = u with respect to the lab frame. Therefore, in the CM 
frame, the velocities of the two masses are 


v m 


Mv 

TO + M ’ 


mv 

m + M ' 


(4.70) 


As a double-check, the difference in the velocities is v, and the ratio of the speeds is 
M/m. 

The important point to realize now is that in the CM frame, the two particles must 
simply reverse their velocities after the collision (provided that they do indeed hit 
each other). This is true because the speeds must still be in the ratio M/m after the 
collision, in order for the total momentum to remain zero. Therefore, the speeds must 
either both increase or both decrease. But if they do either of these, then energy is 
not conserved. 11 


If we now go back to the lab frame by adding the CM velocity of mv/(m + M) to the 
two new velocities of —Mv/(m + M) and mv/(m + M), we obtain final lab velocities 
of 


(m — M)v 
m+M 


and 


(4.71) 


Remark: If m = M, then we see that the left mass stops, and the right mass picks up a 
speed of v. If M > m, then the left mass bounces back with speed « v, and the right mass 
hardly moves. If M, then the left mass keeps plowing along at speed « v, and the right 
mass picks up a speed of « 2v. This 2v is an interesting result (it is clearer if you consider 
things in the frame of the heavy mass to, which is essentially the CM frame), and it leads 
to some neat effects, as in Problem 22. 


4.6.2 Kinetic energy 

Given a system of particles, the relationship between the total kinetic energy in 
two different frames is generally rather messy and unenlightening. But if one of the 
frames is the CM frame, then the relationship turns out to be quite nice. 

11 So we did have to use conservation of energy in this CM-frame solution. But it was far less 
messy than it would have been in the lab frame. 
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Let S' be the CM frame, which moves at constant velocity u with respect to 
another frame S. Then the velocities of the particles in the two frames are related 
by 

Vi = v- + u. (4.72) 

The kinetic energy in the CM frame is 

KE CM = ^m,|v'| 2 . (4.73) 

And the kinetic energy in frame S is 

KE s = ^E^K + ul 2 

= ^ ^2 m;(v' • v- + 2v- • u + u • u) 

= ^E m *Kl 2 + u - (E m * v 0 + ^l u l 2 E m * 

= KE cm + ijhti 2 , (4.74) 

where M is the total mass of the system, and where we have used m i v i = 0, by 
definition of the CM frame. Therefore, the KE in any frame equals the KE in the 
CM frame, plus the kinetic energy of the whole system treated like a point mass M 
located at the CM (which moves with velocity u). An immediate corollary of this 
fact is that if the KE is conserved in a collision in one frame, then it is conserved in 
any other frame. 

4.7 Collisions 

There are two basic types of collisions among particles, namely elastic ones (in which 
kinetic energy is conserved), and inelastic ones (in which kinetic energy is lost). In 
any collision, the total energy is conserved, but in inelastic collisions some of this 
energy goes into the form of heat (that is, relative motion of the atoms inside the 
particles) instead of showing up in the net translational motion of the particle. 

We’ll deal mainly with elastic collisions here, although some situations are in¬ 
herently inelastic, as we’ll discuss in Section 4.8. For inelastic collisions where it is 
stated that a certain fraction, say 20%, of the kinetic energy is lost, only a trivial 
modification of the following procedure is required. 

To solve any elastic collision problem, we simply have to write down the conser¬ 
vation of energy and momentum equations, and then solve for whatever variables 
we want to find. 

4.7.1 1-D motion 

Let’s first look at one-dimensional motion. To see the general procedure, we’ll solve 
the example from Section 4.6.1 again. 
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Figure 4.18 


Example (Two masses in 1-D, again): A mass to with speed v approaches a 
stationary mass M (see Fig. 4.18). The masses bounce off each other elastically. 
What are the final velocities of the particles? Assume all motion takes place in 1-D. 

Solution: Let v' and V' be the final velocities of the masses. 12 Then conservation 
of momentum and energy give, respectively, 

mv + 0 = mv' + MV', 

^mv 2 + 0 = *rm/ 2 +*MW 2 . (4.75) 

We must solve these two equations for the two unknowns v' and V'. Solving for V 
in the first equation and substituting into the second gives 



0 = (to + M)v' 2 — 2 mvv' + (to — M)v 2 , 

0 = ({m + M)v' — (to — M)vj (v 1 — v). (4-76) 


One solution is v' = v, but this is not the one we are concerned with. It is of course 
a solution, because the initial conditions certainly satisfy conservation of energy and 
momentum with the initial conditions (a fine tautology indeed). If you want, you can 
view v' = v as the solution where the particles miss each other. The fact that v' = v 
is always a root can often save you a lot of quadratic-formula trouble. 

The v' = v(m — M )/(to + M) root is the one we want. Plugging this v' back into the 
first of eqs. (4.75) to obtain V' gives 


(to — M)v 
m + M 


(4.77) 


in agreement with eq. (4.71). 


This solution was somewhat of a pain, because it involved a quadratic equation. 
The following theorem is extremely useful because it offers a way to avoid the hassle 
of quadratic equations when dealing with 1-D elastic collisions. 

Theorem 4.3 In a 1-D elastic collision, the relative velocity of two particles after 
a collision is the negative of the relative velocity before the collision. 

Proof: Let the masses be m and M. Let v, and V) be the initial velocities, and let 
Vf and Vf be the final velocities. Conservation of momentum and energy give 

mvi + MVi = mvf + MVf 

^mvf + ^MV? = ^mvj+^MVf. (4.78) 

12 In Section 4.6, a primed denoted a reference frame, but we’re now using a prime to denote 
“final.” 
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Rearranging these yields 


m(vi - v f ) = M(V f -Vi ). 

rn(vf - vj) = M(Vf — V?) (4.79) 

Dividing the second equation by the first gives Vi + Vf = Vi + Vf. Therefore, 

Vi - Vi - ~(v f - V f ), (4.80) 

as we wanted to show. Note that in taking the quotient of these two equations, we 
have lost the Vf = Vi and Vf = Vi solution. But as stated in the above example, 
this is the trivial solution. ■ 

This is a splendid theorem. It has the quadratic energy-conservation statement 
built into it. Hence, using this theorem along with momentum conservation (both of 
which are linear statements) gives the same information as the standard combination 
of eqs. (4.78). 

Note that the theorem is quite obvious in the CM frame (as we argued in the 
example in Section 4.6.1). Therefore, it is true in any frame, because it involves 
only differences in velocities. 

4.7.2 2-D motion 

Let’s now look at the more general case of two-dimensional motion. 3-D motion is 
just more of the same, so we’ll confine ourselves to 2-D. Everything is basically the 
same as in 1-D, except that there is one more momentum equation, and one more 
variable to solve for. This is best seen through an example. 


Example (Billiards): A billiard ball with speed v approaches an identical station¬ 
ary one. The balls bounce off each other elastically, in such a way that the incoming 
one gets deflected by an angle 6 (see Fig. 4.19). What are the final speeds of the 
balls? What is the angle, <j>, at which the stationary ball is ejected? 

Solution: Let v' and V' be the final speeds of the balls. Then conservation of p x , 
p y , and E give, respectively, 

mv = mv' cos 6 + mV' cos </>, 
mv' sin 6 = mV' sin <f>, 

^rnv 2 = jrati' 2 + ^mV' 2 . (4-81) 

We must solve these three equations for the three unknowns v', V', and <t>. There are 
various ways to do this. Here is one. Eliminate <f> by adding the squares of the first 
two equations (after putting the mv' cos 6 on the left-hand side) to obtain 



Figure 4.19 


— 2vv' cos 9 + v' 2 = V' 2 . 


(4.82) 
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Now eliminate V' by combining this with the third equation to obtain 13 

v' = vcos 6. (4.83) 


The third equation then implies 


V' = v sin#. (4.84) 

The second equation then gives m(v cos 6) sin 6 = rn(v sin 0) sin </>, which implies cos 0 = 
sin (j), or 

0 = 90°- 9. (4.85) 

In other words, the balls bounce off at right angles with respect to each other. This fact 
is well known to pool players. Problem 18 gives another (cleaner) way to demonstrate 
this result. 


As we saw in the 1-D example in Section 4.6.1, collisions are often much easier 
to deal with in the CM frame. Using the same reasoning (conservation of p and 
E ) as in that example, we conclude that in 2-D (or 3-D), the final speeds of two 
elastically colliding particles must be the same as the initial speeds. The only degree 
of freedom is the angle of the line containing the final (oppositely directed) velocities. 
This simplicity in the CM frame invariably provides for a cleaner solution than the 
lab frame would yield. A good example of this is Exercise 43, which gives yet another 
way to derive the above right-angle billiard result. 


4.8 Inherently inelastic processes 

There is a nice class of problems where the system has inherently inelastic properties, 
even if it doesn’t appear so at first glance. In such a problem, no matter how you 
try to set it up, there will be inevitable kinetic energy loss that shows up in the form 
of heat. Total energy is conserved, of course; heat is simply another form of energy. 
But the point is that if you try to write down a bunch of (l/2)mr 2 ’s and conserve 
their sum, then you’re going to get the wrong answer. The following example is the 
classic illustration of this type of problem. 


Example (Sand on conveyor belt): Sand drops vertically at a rate a kg/s onto 
a moving conveyor belt. 

(a) What force must you apply to the belt in order to keep it moving at a constant 
speed v? 

(b) How much kinetic energy does the sand gain per unit time? 

(c) How much work do you do per unit time? 

(d) How much energy is lost to heat per unit time? 

13 Another solution is v = 0. In this case, <j> must equal zero, and 9 is not well-defined. We simply 
have the 1-D motion of the example in Section 4.6.1. 
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Solution: 


(a) Your force equals the rate of change of momentum. If we let m be the combined 
mass of the conveyor belt plus the sand on the belt, then 


dp _ d(mv) _ ^dv + dm ^ ^ 

dt dt dt dt 


(4.86) 


where we have used the fact that v is constant, 

(b) The kinetic energy gained per unit time is 


d / mv 2 \ _ dm / v 2 \ _ av 2 

Jt ) = It VTy) = AT 


(4.87) 


(c) The work done by your force per unit time is 

d ^= F ^ = Fv = av 2 , (4.88) 

dt dt 

where we have used eq. (4.86). 

(d) If work is done at a rate av 2 , and kinetic energy is gained at a rate av 2 /2, then 
the “missing” energy must be lost to heat at a rate av 2 — av 2 /2 = av 2 / 2. 


In this example, it turned out that exactly the same amount of energy was lost 
to heat as was converted into kinetic energy of the sand. There is an interesting 
and simple way to see why this is true. In the following explanation, we’ll just deal 
with one particle of mass M that falls onto the conveyor belt, for simplicity. 

In the lab frame, the mass simply gains a kinetic energy of Mv 2 / 2 by the time 
it finally comes to rest with respect to the belt, because the belt moves at speed v. 

Now look at things in the conveyor belt’s reference frame. In this frame, the 
mass comes flying in with an initial kinetic energy of Mv 2 / 2, and then it eventually 
slows down and comes to rest on the belt. Therefore, all of the Mv 2 / 2 energy is 
converted to heat. And since the heat is the same in both frames, this is the amount 
of heat in the lab frame, too. 

We therefore see that in the lab frame, the equality of the heat loss and the gain 
in kinetic energy is a consequence of the obvious fact that the belt moves at the 
same rate with respect to the lab (namely v) as the lab moves with respect to the 
belt (also v). 

In the solution to the above example, we did not assume anything about the 
nature of the friction force between the belt and the sand. The loss of energy to 
heat is an unavoidable result. You might think that if the sand comes to rest on the 
belt very “gently” (over a long period of time), then you can avoid the heat loss. This 
is not the case. In that scenario, the smallness of the friction force is compensated 
by the fact that the force must act over a very large distance. Likewise, if the sand 
comes to rest on the belt very abruptly, then the largeness of the friction force is 
compensated by the smallness of the distance over which it acts. No matter how 
you set things up, the work done by the friction force is the same nonzero quantity. 
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In other problems such as the following one, it is fairly clear that the process is 
inelastic. But the challenge is to correctly use F = dp/dt instead of F = ma, which 
will get you into trouble because the mass is changing. 


Example (Chain on a scale): A chain of length L and mass density a kg/m is 
held such that it hangs vertically just above a scale. It is then released. What is the 
reading on the scale, as a function of the height of the top of the chain? 

First solution: Let y be the height of the top of the chain, and let F be the desired 
force applied by the scale. The net force on the whole chain is F — ( aL)g (with 
upward taken to be positive). The momentum of the chain is ( ay)y. Note that this is 
negative, because y is negative. Equating the net force with the change in momentum 
gives 


F — <jLg = ^ 

= ayy + ay 2 . (4.89) 

The part of the chain that is still above the scale is in free fall. Therefore, y = —g. 
And y = — y), which is the usual result for a falling object. Putting these into 

eq. (4.89) gives 


F = aLg - ayg + 2a(L - y)g 

= 3 a(L - y)g. (4.90) 

This answer has the expected property of equaling zero when y = L, and also the 
interesting property of equaling 3 (aL)g right before the last bit touches the scale. 
Once the chain is completely on the scale, the reading will suddenly drop down to the 
weight of the chain, namely ( aL)g. 

Second solution: The normal force from the scale is responsible for doing two 
things. It holds up the part of the chain that already lies on the scale, and it also 
changes the momentum of the atoms that are suddenly brought to rest when they hit 
the scale. The first of these two parts of the force is simply the weight of the chain 
already on the scale, which is -F weig ht = cr(L — y)g. 

To find the second part of the force, we need to find the change in momentum, dp, of 
the part of the chain that hits the scale during a given time dt. The amount of mass 
that hits the scale in a time dt is dm = a\dy\ = a\y\ dt = —aydt. This mass initially 
has velocity y, and then it is abruptly brought to rest. Therefore, the change in its 
momentum is dp = 0 — ( dm)y = ay 2 dt. The force required to cause this change in 
momentum is thus 

F dp/dt = ^ = ay 2 . (4.91) 

But as in the first solution, we have y = y/2g(L — y). Therefore, the total force from 
the scale is 


^weight + T'dp/dt 

a(L - y)g + 2 a(L - y)g 
3a(L - y)g. 


(4.92) 
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Note that F dp / dt = 2F weight (until the chain is completely on the scale), independent 
of y. 


Many other problems of this sort are included in the exercises and problems for 
this chapter. 
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4.9 Exercises 

Section 4-1: Conservation of energy in 1-D 

1. Cart in a valley 

A cart containing sand starts at rest and then rolls, without any energy loss 
to friction, down into a valley and then up a hill on the other side. Let the 
initial height be hi, and let the final height attained on the other side be hi- 
If the cart leaks sand along the way, how does h ,2 compare to hi? 

2. Walking on a escalator 

An escalator moves downward at constant speed. You walk up the escalator 
at this same speed, so that you remain at rest with respect to the ground. Are 
you doing any work? 

3. Heading to infinity * 

A particle moves away from the origin under the influence of a potential V(x ) = 
—A|x| n . For what values of n will it reach infinity in a finite time? 

4. Work in different frames * 

An object, initially at rest, is subject to a force that causes it to undergo 
constant acceleration a for a time t. Verify explicitly that W = A K in (a) the 
lab frame, and (b) a frame moving to the left at speed V. 

5. Constant x ** 

A bead, under the influence of gravity, slides along a frictionless wire whose 
height is given by the function y(x). Assume that at position (x, y) = (0,0), 
the wire is horizontal and the bead passes this point with a given speed no 
to the right. What should the shape of the wire be (that is, what is y as 
a function of x ) so that the horizontal speed remains no at all times? One 
solution is simply y = 0. Find the other. 14 

6. Spring energy 

Using the explicit form of the position of a mass on the end of a spring, 
x(t) = Acos(u;t + </>), verify that the total energy is conserved. 

7. Hanging spring * 

A massless spring with spring-constant k hangs vertically from a ceiling, ini¬ 
tially at its relaxed length. A mass m is then attached to the bottom and is 
released. 

(a) Calculate the total potential energy of the system, as a function of the 
height y (which is negative), relative to the initial position. Make a rough 
plot of V{y). 

14 Solve this exercise in the spirit of Problem 6, that is, by solving a differential equation. Once 
you get the answer, you’ll see that you could have just written it down without any calculations, 
based on your knowledge of a certain kind of physical motion. 
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(b) Find yo, the point at which the potential energy is minimum. 

(c) Rewrite the potential energy as a function of z = y — yo- Explain why 
your result shows that a hanging spring can be considered to be a spring 
in a world without gravity, provided that the new equilibrium point, yo, 
is taken to be the “relaxed” length of the spring. 

8. Removing the friction ** 

A block of mass m is supported by a spring on an inclined plane as shown in 
Fig. 4.20. The spring constant is k, the plane’s angle of inclination is 6, and 
the coefficient of friction between the block and the plane is /j. 

(a) You move the block down the plane, compressing the spring. What is the 
maximum compression length of the spring (relative to the relaxed length 
it has when nothing is attached to it) that allows the block to remain at 
rest when you let go of it? 

(b) Assume that the block is at the maximum compression you found in 
part (a). At a given instant, you somehow cause the plane to become 
frictionless, and the block springs up along the plane. What must the 
relation between 6 and the original n be, so that the block reaches its 
maximum height when the spring is at its relaxed length? 



Figure 4.20 


9. Spring and friction ** 

A spring with spring-constant k stands vertically, and a mass m is placed on 
top of it. The mass is gradually lowered to its equilibrium position. With the 
spring held at this compression length, the system is rotated to a horizontal 
position. The left end of the spring is attached to a wall, and the mass is 
placed on a table with coefficient of kinetic friction fi = 1/8; see Fig. 4.21. 
The mass is released. 

(a) What is the initial compression of the spring? 

(b) How much does the maximal stretch (or compression) of the spring de¬ 
crease after each half-oscillation? Hint: I wouldn’t try to solve this by 
using F = ma. 

(c) How many times does the mass oscillate back and forth before coming to 
rest? 

10. Over the pipe ** 

A frictionless cylindrical pipe with radius r is positioned with its axis parallel 
to the ground, at height h. What is the minimum initial speed at which a 
ball must be thrown (from ground level) in order to make it over the pipe? 
Consider two cases: (a) the ball is allowed to touch the pipe, and (b) the ball 
is not allowed to touch the pipe. 
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Figure 4.24 





Figure 4.25 


11. Pendulum projectile * 

A pendulum is held with its string horizontal and is then released. The mass 
swings down, and then on its way back up, the string is cut when it makes an 
angle of 0 with the vertical; see Fig. 4.22. What should 6 be, so that the mass 
travels the largest horizontal distance by the time it returns to the height it 
had when the string was cut? 

12. Bead on a hoop ** 

A bead is initially at rest at the top of a fixed frictionless hoop of radius R , 
which lies in a vertical plane. The bead is given a tiny kick so that it slides 
down and around the hoop. At what points on the hoop does the bead exert 
a maximum horizontal force on the hoop? 

13. Beads on a hoop ** 

Two beads of mass m are initially at rest at the top of a frictionless hoop of 
mass M and radius R, which stands vertically on the ground. The beads are 
given tiny kicks, and they slide down the hoop, one to the right and one to 
the left, as shown in Fig. 4.23. What is the largest value of m/M for which 
the hoop will never rise up off the ground? 

14. Stationary bowl *** 

A hemispherical bowl of mass M rests on a table. The inside surface of the 
bowl in frictionless, while the coefficient of friction between the bottom of the 
bowl and the table is fi = 1. A particle of mass m is released from rest at the 
top of the bowl and slides down into it, as shown in Fig. 4.24. What is the 
largest value of m/M for which the bowl will never slide on the table? Hint: 
The angle you will be concerned with is not 45°. 

15. Roller coaster * 

A roller coaster car starts at rest and coasts down a frictionless track. It 
encounters a vertical loop of radius R. How much higher than the top of the 
loop must the car start if it to remain in contact with the track at all times? 

16. Pendulum and peg * 

A pendulum of length L is initially held horizontal, and is then released. The 
string runs into a peg a distance d below the pivot, as shown in Fig. 4.25. 
What is the smallest value of d for which the string remains taught at all 
times? 

17. Unwinding string ** 

A mass is connected to one end of a massless string, the other end of which 
is connected to a very thin frictionless vertical pole. The string is initially 
wound completely around the pole, in a very large number of little horizontal 
circles, with the mass touching the pole. The mass is released, and the string 
gradually unwinds. What angle does the string make with the pole at the 
moment it becomes completely unwound? 
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18. Leaving the hemisphere **** 

A point particle of mass to sits at rest on top of a frictionless hemisphere of 
mass M, which rests on a frictionless table. The particle is given a tiny kick 
and slides down the hemisphere. At what angle 9 (measured from the top of 
the hemisphere) does the particle lose contact with the hemisphere? 

In answering this question for m ^ M, it is sufficient for you to produce an 
equation that 6 must satisfy (it will be a cubic). However, for the special case 
of to = M, this equation can be solved without too much difficulty; find the 
angle in this case. 

Section 4-4 : Gravity 

19. Projectile between planets * 

Two planets of mass M and radius R are at rest with respect to each other, 
with their centers a distance 4 R apart. You wish to fire a projectile from the 
surface of one planet to the other. What is the minimum initial speed for 
which this is possible? 

20. Spinning quickly * 

Consider a planet with uniform mass density p. If the planet rotates too fast, 
it will fly apart. Show that the minimum period of rotation is given by 



What is the minimum T if p = 5.5g/cm 3 (the average density of the earth)? 

21. Supporting a tube * 

Imagine the following unrealistic undertaking. Drill a narrow tube (with cross 
sectional area A) from the surface of the earth down to the center. Then line 
the cylindrical wall of the tube with a frictionless coating. Then fill the tube 
back up with the dirt (and magma, etc.) you originally removed. What force 
is necessary at the bottom of the tube of dirt (that is, at the center of the 
earth) to hold it up? Let the earth’s radius be R, and assume a uniform mass 
density p. 

22. Force from a straight wire ** 

A particle of mass to is placed a distance l away from an infinitely long straight 
wire with mass density pkg/rri. Show that the force on the particle is F = 
IGfym/t 

23. Speedy travel ** 

A straight tube is drilled between two points on the earth, as shown in 
Fig. 4.26. An object is dropped into the tube. What is the resulting mo¬ 
tion? How long does it take to reach the other end? Ignore friction, and 
assume (erroneously) that the density of the earth is constant. 
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24. Ratio of potentials * 

Consider the following two systems: (1) a mass m is placed at the corner of a 
flat square sheet of mass M, and (2) a mass m is placed at the center of a flat 
square sheet of mass M. What is the ratio of the potential energies of m in 
the two systems? Hint: Find A and B in the suggestive relations in Fig. 4.27. 

25. Relative speed * 

Two particles with masses m and M are initially at rest, a very large (essen¬ 
tially infinite) distance apart. They are attracted to each other due to gravity. 
What is their relative speed when they are a distance r apart? 

26. Orbiting stick ** 

Consider a planet of mass M and radius R. A very long stick of length 2 R 
extends from just above the surface of the planet, to a radius 3 R. If initial 
conditions have been set up so that the stick moves in a circular orbit while 
always pointing radially (see Fig. 4.28), what is the period of this orbit? How 
does this period compare to the period of a satellite in a circular orbit of radius 
2 R1 


27. Geosynchronous orbits ** 


(a) Let the earth’s radius be R, its average density be p, and its angular 
frequency of rotation be u. Show that if a satellite is to remain above 
the same point on the equator at all times, then it must travel in a circle 
of radius r]R, where 

AirGp 


V 


3w 2 


(4.93) 


What is the numerical value for rfl 


(b) Instead of a satellite, consider a long rope with uniform mass density 
extending radially from the surface of the earth out to a radius p'R . 15 
Show that if the rope is to remain above the same point on the equator 
at all times, then rf must be given by 


8irGp 
~~ 3w 2 ' 


(4.94) 


What is the numerical value for r/? Where is the tension in the rope 
maximum? Hint: No messy calculations required. 


28. Spherical shell ** 


(a) A spherical shell of mass M has inner radius Ri and outer radius R-z- A 
particle of mass m is located a distance r from the center of the shell. 
Calculate (and make a rough plot of) the force on to, as a function of r, 
for 0 < r < oo. 

15 Any proposed space elevator wouldn’t have uniform mass density. But this simplifies problem 
still gives a good idea of the general features. 
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(b) If the mass m is dropped from r = oo and falls down through the shell 
(assume that a tiny hole has been drilled in it), what will m’s speed be at 
the center of the shell? You can let R 2 = 2R\ in this part of the problem, 
to keep things from getting too messy. Give your answer in terms of 
R = R\. 


29. Roche limit * 


A small spherical rock covered with sand falls in radially toward a planet. 
Let the planet have radius R and density p p , and let the rock have density 
p r . It turns out that when the rock gets close enough to the planet, the tidal 
force ripping the sand off the rock will be larger than the gravitational force 
attracting the sand to the rock. The cutoff distance is called the Roche limit. 
Show that it is given by 16 


d = R 



(4.95) 


30. Maximal gravity *** 

Given a point P in space, and given a piece of malleable material of constant 
density, how should you shape and place the material in order to create the 
largest possible gravitational field at PI 

Section 4-5: Momentum 

31. Sticking masses 

A mass 3m moving east at speed v collides with a mass 2m moving northeast 
at speed 2v. The masses stick together. What is the resulting speed and 
direction of the combined mass? 

32. Snow on a sled * 

A sled on which you are riding is given an initial push and slides across friction¬ 
less ice. Snow is falling vertically (in the frame of the ice) on the sled. Assume 
that the sled travels in tracks that constrain it to move in a straight line. 
Which of the following three strategies causes the sled to move the fastest? 
The slowest? Explain your reasoning. 

(a) You sweep the snow off the sled so that it leaves the sled in the direction 
perpendicular to the sled’s tracks, as seen by you in the frame of the sled. 

(b) You sweep the snow off the sled so that it leaves the sled in the direction 
perpendicular to the sled’s tracks, as seen by someone in the frame of the 
ice. 

(c) You do nothing. 

16 For things orbiting circularly instead of falling radially inward, the cutoff distance is different, 
but only slightly. See the exercise in Chapter 9. The Roche limit gives the radial distance below 
which loose objects won’t collect into larger blobs. Our moon (which is a sphere of rock and sand) 
lies outside the earth’s Roche limit. But Saturn’s rings (which consists of loose ice particles) lie 
inside its limit. 
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33. Speedy rockets ** 

Assume that it is impossible to build a structurally sound container that can 
hold fuel of more than, say, nine times its mass. It would then seem like the 
limit for the speed of a rocket is it In 10. How can you build a rocket that goes 
faster than this? 

34. Maximum P and E of rocket * 

A rocket ejects its exhaust at a given speed u. What is the mass of the rocket 
(including unused fuel) when its momentum is maximum? What is the mass 
when its energy is maximum? 

35. Leaky bucket *** 

Consider the setup in Problem 16, but now let the sand leak at a rate dm/dt = 
—bM. In other words, the rate is constant with respect to time, not distance. 
We’ve factored out an M here, just to make the calculations a little nicer. 

(a) Find v(t) and x(t) for the times when the bucket contains a nonzero 
amount of sand. 

(b) What is the maximum value of the bucket’s kinetic energy, assuming it 
is achieved before it hits the wall? 

(c) What is the maximum value of the magnitude of the bucket’s momentum, 
assuming it is achieved before it hits the wall? 

(d) For what value of b does the bucket become empty right when it hits the 
wall? 

36. Throwing a brick *** 

A brick is thrown from ground level, at an angle 6 with respect to the (hor¬ 
izontal) ground. Assume that the long face of the brick remains parallel to 
the ground at all times, and that there is no deformation in the ground or 
the brick when the brick hits the ground. If the coefficient of friction between 
the brick and the ground is ft, what should 9 be so that the brick travels the 
maximum total horizontal distance before finally coming to rest? Hint: The 
brick slows down when it hits the ground. Think in terms of impulse. 

Section 4-7: Collisions 

37. A 1-D collision * 

Consider the following one-dimensional collision. A mass 2m moves to the 
right, and a mass m moves to the left, both with speed v. They collide 
elastically. Find their final lab-frame velocities. Solve this by: 


(a) Working in the lab frame. 

(b) Working in the CM frame. 
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38. Perpendicular vectors * 

A mass m, moving with speed v. collides elastically with a stationary mass 
2m. Let their resulting velocities be hj and V 2 , respectively. Show that V 2 
must be perpendicular to V 2 + 2v\. Hint See Problem 18. 


39. Maximum number of collisions ** 

N balls are constrained to move in one dimension. If you are allowed to 
pick the initial velocities, what is the maximum number of collisions you can 
arrange for the balls to have among themselves? Assume the collisions are 
elastic. 


40. Triangular room ** 

A ball is thrown against a wall of a very long triangular room which has vertex 
angle 9. The initial direction of the ball is parallel to the angle bisector (see 
Fig. 4.29). How many bounces does the ball make? Assume the walls are 
frictionless. 

41. Three pool balls * 

A pool ball with initial speed v is aimed right between two other pool balls, 
as shown in Fig. 4.30. If the two right balls leave the collision at 30° with 
respect to the initial line of motion, find the final speeds of all three balls. 

42. Equal angles ** 

(a) A mass 2m moving at speed Vo collides elastically with a stationary mass 
m. If the two masses scatter at equal angles with respect to the incident 
direction, what is this angle? 

(b) What is the largest number that the above “2” can be replaced with, if 
you want it to be possible for the masses to scatter at equal angles? 



Figure 4.29 


o 



Figure 4.30 


43. Right angle in billiards ** 

A billiard ball collides elastically with an identical stationary one. By looking 
at the collision in the CM frame, show that the angle between the resulting 
trajectories in the lab frame is 90°. 

44. Maximum v y ** 

A mass M moving in the positive indirection collides elastically with a sta¬ 
tionary mass m. The collision is not necessarily head-on, so the masses may 
come off at angles, as shown in Fig. 4.31. Let 6 be the angle of m’s resulting 
motion. What should 9 be so that m has the largest possible speed in the 
y-direction? Hint: Think about what the collision should look like in the CM 
frame. 



Figure 4.31 
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45. Maximum deflection *** 

A mass M collides with a stationary mass m. If M < m, then it is possible for 
M to bounce directly backwards. However, if M > m, then there is a maximal 
angle of deflection of M. Show that this maximal angle equals sin -1 (m/M). 
Hint: It is possible to do this problem by working in the lab frame, but you 
can save yourself a lot of time by considering what happens in the CM frame, 
and then shifting back to the lab frame. 

46. Balls in a semicircle **** 

N identical balls lie equally spaced in a semicircle on a frictionless horizontal 
table, as shown. The total mass of these balls is M. Another ball of mass m 
approaches the semicircle from the left, with the proper initial conditions so 
that it bounces (elastically) off all N balls and finally leaves the semicircle, 
heading directly to the left. See Fig. 4.32. 

(a) In the limit N —? oo (so the mass of each ball in the semicircle, M/N, 
goes to zero), find the minimum value of M/m that allows the incoming 
ball to come out heading directly to the left. Hint: You’ll need to do 
Exercise 45 first. 

(b) In the minimum M/m case found in part (a), show that the ratio of m’s 
final speed to initial speed equals e~ n . 


Total mass M • 



Figure 4.32 





Figure 4.33 



47. Midair collision ** 

A ball is held and then released. At the instant it is released, an identical 
ball, moving horizontally with speed v, collides elastically with it. What is the 
maximum horizontal distance the latter ball can travel by the time it returns 
to the height of the collision? 

48. Bouncing between rings ** 

Two fixed circular rings, in contact with each other, stand in a vertical plane. 
A ball bounces elastically back and forth between the rings (see Fig. 4.33). 
Assume that initial conditions have been set up so that the ball’s motion 
forever lies in one parabola. Let this parabola hit the rings at an angle 9 
from the horizontal. Show that if you want the magnitude of the change 
in the horizontal component of the ball’s momentum at each bounce to be 
maximum, then you should pick cos 9 = (>/5 — l)/2, which just happens to be 
the inverse of the golden ratio. 

49. Bouncing between surfaces ** 

Consider the following generalization of the previous exercise. A ball bounces 
back and forth between a surface defined by f(x) and its reflection across the 
y-axis (see Fig. 4.34). Assume that initial conditions have been set up so 
that the ball’s motion forever lies in one parabola, with the contact points 
located at Exq. For what function f(x) is the magnitude of the change in the 


Figure 4.34 
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horizontal component of the ball’s momentum at each bounce independent of 
x 0 ? 

50. Drag force on a sphere ** 

A sphere of mass M and radius R moves with speed V through a region of 
space that contains particles of mass m that are at rest. There are n of these 
particles per unit volume. Assume m <C M, and assume that the particles do 
not interact with each other. What is the drag force on the sphere? 

51. Block and bouncing ball **** 

A block with large mass M slides with speed Vo on a frictionless table toward 
a wall. It collides elastically with a ball with small mass m, which is initially 
at rest at a distance L from the wall. The ball slides towards the wall, bounces 
elastically, and then proceeds to bounce back and forth between the block and 
the wall. 

(a) How close does the block come to the wall? 

(b) How many times does the ball bounce off the block, by the time the block 
makes its closest approach to the wall? 

Assume that M m, and give your answers to leading order in m/M. 


Section 4-8: Inherently inelastic processes 

52. Slowing down, speeding up * 

A plate of mass M initially moves horizontally at speed v on a frictionless 
table. A mass m is dropped vertically onto it and soon comes to rest with 
respect to the plate. How much energy is required to bring the system back 
up to speed v? 

53. Falling rope ** 

A rope with mass M and length L is held in the position shown in Fig. 4.35, 
with one end attached to a support. Assume that only a negligible length of 
the rope starts out below the support. The rope is released. Find the force 
that the support applies to the rope, as a function of time. 

54. Pulling the rope back ** 

A rope of length L and mass density a kg/m lies outstretched on a frictionless 
horizontal table. You grab one end and pull it back along itself, in a parallel 
manner, as shown in Fig. 4.36. If your hand starts from rest and has constant 
acceleration a, what is your force right before the rope is straightened out? 

55. Pulling the rope ** 

A rope with mass density a kg/m lies in a heap at the edge of a table. One 
end of the rope initially sticks out an infinitesimal distance from the heap. 
You grab this end and accelerate it downward with acceleration a. Assume 
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that there is no friction of the rope with itself as it unravels. As a function of 
time, what force does your hand apply to the rope? Find the value of a that 
makes your force always equal to zero. 

56. Heap and block ** 

A rope of mass m and length L lies in a heap on the floor, with one end 
attached to a block of mass M. The block is given a sudden kick and instantly 
acquires a speed Vo- Let x be the distance traveled by the block. In terms of 
x, what is the tension in the rope, just to the right of the heap, that is, at the 
point P shown? See Fig. 4.37. There is no friction in this problem - none 
with the floor, and none in the rope with itself. 

57. Downhill dustpan *** 

A dustpan slides down a plane inclined at angle 9. Dust is uniformly dis¬ 
tributed on the plane, and the dustpan collects the dust in its path. After a 
long time, what is the acceleration of the dustpan? Assume there is no friction 
between the dustpan and plane. 

58. Touching the floor **** 

A rope with mass density a kg/m hangs from a spring with spring-constant k. 
In the equilibrium position, a length L is in the air, and the bottom part of 
the rope lies in a heap on the floor; see Fig. 4.38. The rope is raised by a very 
small distance, b, and then released. What is the amplitude of the oscillations, 
as a function of time? 

Assume that (1) L>&, (2) the rope is very thin, so that the size of the heap 
on the floor is very small compared to b, (3) the length of the rope in the initial 
heap is larger than b, so that some of the rope always remains in contact with 
the floor, and (4) there is no friction of the rope with itself inside the heap. 


Figure 4.38 
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4.10 Problems 


Section 4-1-' Conservation of energy in 1-D 

1. Minimum length * 

The shortest configuration of string joining three given points is the one shown 
at the top of Fig. 4.39, where all three angles are 120°. 17 Explain how you 
could experimentally prove this fact by cutting three holes in a table and 
making use of three equal masses attached to the ends of strings (the other 
ends of which are connected), as shown in Fig. 4.39. 

2. Heading to zero * 

A particle moves toward x = 0 under the influence of a potential V(x) = 
—A\x\ n , where A > 0 and n > 0. The particle has barely enough energy to 
reach x = 0. For what values of n will it reach x = 0 in a finite time? 

3. Leaving the sphere * 

A small ball rests on top of a fixed frictionless sphere. The ball is given a tiny 
kick and slides downward. At what point does it lose contact with the sphere? 

4. Pulling the pucks ** 

(a) A massless string of length 2£ connects two hockey pucks that lie on 
frictionless ice. A constant horizontal force F is applied to the midpoint 
of the string, perpendicular to it (see Fig. 4.40). How much kinetic 
energy is lost when the pucks collide, assuming they stick together? 

(b) The answer you obtained above should be very clean and nice. Find the 
slick solution (assuming that you solved the problem the “normal” way, 
above) that makes it transparent why the answer is so nice. 




Figure 4.39 



Figure 4.40 


5. V{x) vs. a hill **** 

A bead, under the influence of gravity, slides along a frictionless wire whose 
height is given by the function V(x) (see Fig. 4.41). Find an expression for 
the bead’s horizontal acceleration, x. (It can depend on whatever quantities 
you need it to depend on.) 

You should find that the result is not the same as the x for a particle moving 
in one dimension in the potential mgV(x), in which case x = —gV'. But if you 
grab hold of the wire, is there any way you can move it so that the bead’s x is 
equal to the x = —gV' result due to the one-dimensional potential mgV(x )? 

6. Constant y ** 

A bead, under the influence of gravity, slides along a frictionless wire whose 
height is given by the function y(x). Assume that at position (x,y) = (0,0), 

17 If the three points form a triangle that has an angle greater than 120°, then the string simply 
passes through the point where that angle is. We won’t worry about this case. 



Figure 4.41 
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the wire is vertical and the bead passes this point with a given speed no 
downward. What should the shape of the wire be (that is, what is y as a 
function of x) so that the vertical speed remains no at all times? 

Section 4-2: Small Oscillations 

7 . Small oscillations * 

A particle moves under the influence of the potential V(x) = —Cx n e~ ax . Find 
the frequency of small oscillations around the equilibrium point. 

8. Hanging mass 

The potential for a mass hanging from a spring is V(y ) = ky 2 /2 + mgy, where 
y = 0 corresponds to the position of the spring when nothing is hanging from 
it. Find the frequency of small oscillations around the equilibrium point. 

Section 4-4 : Gravity 

9. Zero force inside a sphere * 

Show that the gravitational force inside a spherical shell is zero by showing 
that the pieces of mass at the ends of the thin cones in Fig. 4.42 give canceling 
forces at point P. 

10. Escape velocity * 

(a) Find the escape velocity (that is, the velocity above which a particle will 
escape to r = oo) for a particle on a spherical planet of radius R and 
mass M. What is the numerical value for the earth? The moon? The 
sun? 

(b) Approximately how small must a spherical planet be in order for a human 
to be able to jump off? Assume a density roughly equal to the earth’s. 

11. Through the hole ** 

(a) A hole of radius R is cut out from an infinite flat sheet of mass density 
a. Let L be the line that is perpendicular to the sheet and that passes 
through the center of the hole. What is the force on a mass m that is 
located on L, at a distance x from the center of the hole? Hint. Consider 
the plane to consist of many concentric rings. 

(b) If a particle is released from rest on L, very close to the center of the 
hole, show that it undergoes oscillatory motion, and find the frequency 
of these oscillations. 

(c) If a particle is released from rest on L, at a distance x from the sheet, 
what is its speed when it passes through the center of the hole? What is 
your answer in the limit x S> R7 
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12. Ratio of potentials ** 

Consider a cube of uniform mass density. Find the ratio of the gravitational 
potential energy of a mass at a corner to that of a mass at the center. Hint: 
There’s a slick way that doesn’t involve any messy integrals. 

Section 4-5: Momentum 

13. Snowball * 

A snowball is thrown against a wall. Where does its momentum go? Where 
does its energy go? 

14. Propelling a car ** 

For some odd reason, you decide to throw baseballs at a car of mass M, which 
is free to move frictionlessly on the ground. You throw the balls at the back of 
the car at speed u, and at a mass rate of a kg/s (assume the rate is continuous, 
for simplicity). If the car starts at rest, find its speed and position as a function 
of time, assuming that the balls bounce elastically directly backwards off the 
back window. 

15. Propelling a car again ** 

Do the previous problem, except now assume that the back window is open, 
so that the balls collect inside the car. 

16. Leaky bucket ** 

At t = 0, a massless bucket contains a mass M of sand. It is connected to 
a wall by a massless spring with constant tension T (that is, independent of 
length). 18 See Fig. 4.43. The ground is frictionless, and the initial distance 
to the wall is L. At later times, let x be the distance from the wall, and let m 
be the mass of sand in the bucket. 

The bucket is released. On its way to the wall, it leaks sand at a rate dm/dx = 
M/L. In other words, the rate is constant with respect to distance, not time. 
Note that dx is negative, so dm is also. 

(a) What is the kinetic energy of the (sand in the) bucket, as a function of 
the distance from the wall? What is its maximum value? 

(b) What is the magnitude of the momentum of the bucket, as a function of 
the distance from the wall? What is its maximum value? 

17. Another leaky bucket *** 

Consider the setup in Problem 16, but now let the sand leak at a rate pro¬ 
portional to the bucket’s acceleration. That is, dm/dt = bx. Note that x is 
negative, so dm is also. 

18 You can construct a constant-tension spring with a regular Hooke’s-law spring in the following 
way. Pick the spring constant to be very small, and stretch the spring a very large distance; have 
it pass through a hole in the wall, with its other end bolted down a large distance to the left of the 
wall. Any changes in the bucket’s position will then yield a negligible change in the spring’s force. 
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Figure 4.44 


(a) Find the mass as a function of time, m(t). 

(b) Find v(t) and x(t) for the times when the bucket contains a nonzero 
amount of sand. Also find v(m) and x(rn). What is the speed right 
before all the sand leaves the bucket (assuming it hasn’t hit the wall 
yet)? 

(c) What is the maximum value of the bucket’s kinetic energy, assuming it 
is achieved before it hits the wall? 

(d) What is the maximum value of the magnitude of the bucket’s momentum, 
assuming it is achieved before it hits the wall? 

(e) For what value of b does the bucket become empty right when it hits the 
wall? 

Section 4 ..7: Collisions 

18. Right angle in billiards * 

A billiard ball collides elastically with an identical stationary one. Use the 
fact that mv 2 / 2 may be written as m(v • v)/2 to show that the angle between 
the resulting trajectories is 90°. 

19. Bouncing and recoiling ** 

A ball of mass m and initial speed no bounces back and forth between a fixed 
wall and a block of mass M (with M 3> m). See Fig. 4.44. M is initially 
at rest. Assume that the ball bounces elastically and instantaneously. The 
coefficient of kinetic friction between the block and the ground is /i. There is 
no friction between the ball and the ground. 

What is the speed of the ball after the nth bounce off the block? How far 
does the block eventually move? How much total time does the block actually 
spend in motion? Work in the approximation where M m, and assume 
that n is large enough so that the block comes to rest by the time the next 
bounce occurs. 

20. Drag force on a sheet ** 

A sheet of mass M moves with speed V through a region of space that contains 
particles of mass m and speed v. There are n of these particles per unit volume. 
The sheet moves in the direction of its normal. Assume m <C M, and assume 
that the particles do not interact with each other. 

(a) If v <C V, what is the drag force per unit area on the sheet? 

(b) If v V, what is the drag force per unit area on the sheet? Assume, for 
simplicity, that the component of every particle’s velocity in the direction 
of the sheet’s motion is exactly ±n/2. 19 

19 In reality, the velocities are randomly distributed, but this idealization actually gives the correct 
answer because the average speed in any direction is \v m \^v/2. The result v% =v 2 /3, which may 
be familiar to you, isn’t relevant here. 
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21. Drag force on a cylinder ** 

A cylinder of mass M and radius R moves with speed V through a region of 
space that contains particles of mass m that are at rest. There are n of these 
particles per unit volume. The cylinder moves in a direction perpendicular to 
its axis. Assume m -C M, and assume that the particles do not interact with 
each other. What is the drag force per unit length on the cylinder? 

22. Basketball and tennis ball ** 

(a) A tennis ball with a small mass m 2 sits on top of a basketball with a 
large mass m\ (see Fig. 4.45). The bottom of the basketball is a height h 
above the ground, and the bottom of the tennis ball is a height h+d above 
the ground. The balls are dropped. To what height does the tennis ball 
bounce? Note: Work in the approximation where m\ is much larger than 
m 2 , and assume that the balls bounce elastically. Also assume, for the 
sake of having a nice clean problem, that the balls are initially separated 
by a small distance, and that the balls bounce instantaneously. 

(b) Now consider n balls, B\, ... , B n . having masses mi, m 2 , ..., m n (with 
mi > m 2 > ■ ■ • > m n ), standing in a vertical stack (see Fig. 4.46). The 
bottom of -Bi is a height h above the ground, and the bottom of B n is 
a height h + i above the ground. The balls are dropped. In terms of n, 
to what height does the top ball bounce? Note: Make assumptions and 
approximations similar to the ones in part (a). 

If h = 1 meter, what is the minimum number of balls needed for the top 
one to bounce to a height of at least 1 kilometer? To reach escape veloc¬ 
ity? Assume that the balls still bounce elastically (which is a bit absurd 
here), and ignore wind resistance, etc., and assume that i is negligible. 


Section 4-8: Inherently inelastic processes 

23. Colliding masses * 

A mass M, initially moving at speed v. collides and sticks to a mass m, initially 
at rest. Assume M m, and work in this approximation. What are the final 
energies of the two masses, and how much energy is lost to heat, in: 

(a) The lab frame? 

(b) The frame in which M is initially at rest? 

24. Pulling a chain ** 

A chain of length L and mass density a lies straight on a frictionless horizontal 
surface. You grab one end and pull it back along itself, in a parallel manner 
(see Fig. 4.47). Assume that you pull it at constant speed v. What force 
must you apply? What is the total work that you do, by the time the chain is 
straightened out? How much energy is lost to heat, if any? 
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25. Pulling a rope ** 

A rope of mass density a lies in a heap on the floor. You grab an end and 
pull horizontally with constant force F. What is the position of the end of the 
rope, as a function of time, while it is unravelling? Assume that the rope is 
greased, so that it has no friction with itself. 

26. Raising the rope ** 

A rope of length L and mass density a lies in a heap on the floor. You grab 
one end of the rope and pull upward with a force such that the rope moves 
at constant speed v. What is the total work you do, by the time the rope is 
completely off the floor? How much energy is lost to heat, if any? Assume 
that the rope is greased, so that it has no friction with itself. 

27. Falling rope *** 

(a) A rope of length L lies in a straight line on a frictionless table, except 
for a very small piece at one end which hangs down through a hole in 
the table. This piece is released, and the rope slides down through the 
hole. What is the speed of the rope at the instant it loses contact with 
the table? 

(b) Answer the same question, but now let the rope lie in a heap on a table, 
except for a very small piece at one end which hangs down through the 
hole. Assume that the rope is greased, so that it has no friction with 
itself. 

28. The raindrop **** 

Assume that a cloud consists of tiny water droplets suspended (uniformly 
distributed, and at rest) in air, and consider a raindrop falling through them. 
What is the acceleration of the raindrop? Assume that the raindrop is initially 
of negligible size and that when it hits a water droplet, the droplet’s water 
gets added to it. Also, assume that the raindrop is spherical at all times. 
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4.11 Solutions 


1. Minimum length 

Cut three holes in the table at the locations of the three given points. Drop the masses 
through the holes, and let the system reach its equilibrium position. The equilibrium 
position is the one with the lowest potential energy of the masses, that is, the one 
with the most string hanging below the table. In other words, it is the one with the 
least string lying on the table. This is the desired minimum-length configuration. 
What are the angles at the vertex of the string? The tensions in all three strings are 
equal to mg. The vertex of the string is in equilibrium, so the net force on it must 
be zero. This implies that each string must bisect the angle formed by the other two. 
Therefore, the angles between the strings must all be 120°. 

2. Heading to zero 

Write F = ma as mvdv/dx = —V'(x). Separating variables and integrating gives 
mv 2 /2 = C — V(x), where C is a constant of integration. The given information tells 
us that v = 0 when x = 0. Therefore C = 0. C is simply the total energy of the 
particle. Writing v as dx/dt and separating variables again gives 


dx 




(4.96) 


Assume that the particle starts at position xq > 0. Let T be the time to reach the 
origin. Integrating the previous equation from x 0 to x = 0 gives 



(4.97) 


The integral on the left is finite only if n/2 < 1. Therefore, the condition that T is 
finite is 

n < 2. (4.98) 


Remark: The particle will take a finite time to reach the top of a triangle or the curve 
—Ax 3 / 2 . But it will take an infinite time to reach the top of a parabola, cubic, etc. A circle 
looks like a parabola at the top, so T is infinite in that case also. In fact, any nice polynomial 
function V(x) will require an infinite T to reach a local maximum, because the Taylor series 
starts at order (at least) two around an extremum. & 

3. Leaving the sphere 

First Solution: Let R be the radius of the sphere, and let 9 be the angle of the 
ball, measured from the top of the sphere. The radial F = ma equation is 


mg cos 9 — N = 


R ’ 


(4.99) 


where N is the normal force. The ball loses contact with the sphere when the normal 
force becomes zero (that is, when the normal component of gravity is not large enough 
to account for the centripetal acceleration of the ball). Therefore, the ball loses contact 
when 


—=— mg cos 9. 
R 


(4.100) 
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But conservation of energy gives mv 2 / 2 = mgR( 1—cos 9). Hence, v = y/2gli(l — cos 9 )- 
Plugging this into eq. (4.100), we see that the ball leaves the sphere when 

cos 0 = . (4.101) 

This corresponds to 9 ss 48.2°. 

Second Solution: Let’s assume that the ball always stays in contact with the sphere, 
and then we’ll find the point where the horizontal component of v starts to decrease 
(which it of course can’t do, because the normal force doesn’t have a “backwards” 
component). From above, the horizontal component of v is 

v x = v cos 9 = \/2gR{\ — cos 9) cos 9. (4.102) 

Taking the derivative of this, we find that the maximum occurs when cos 9 = 2/3. So 
this is where v x would start to decrease if the ball were constrained to remain on the 
sphere. But since there is no such constraining force available, the ball loses contact 
when cos 9 = 2/3. 

4. Pulling the pucks 

(a) Let 9 be defined as in Fig. 4.48. Then the tension in the string is T = F/(2 cos 9), 
because the force on the massless kink in the string must be zero. Consider the 
“top” puck. The component of the tension in the y-direction is —T sin 0 = 
—F tan 9/2. The work done on the puck by this component is therefore 

f° -Ftanfl , 

w v = J -2- dy 

- 

- 

FI cos 6> |° 

~~2“ U 

FI 

= T • (4-103) 

By the work-energy theorem (or equivalently, by separating variables and in¬ 
tegrating F y = mvydvy/dy), this work equals mv 2 / 2. The kinetic energy lost 
when the two pucks stick together is twice this quantity (v x doesn’t change 
during the collision). Therefore, 

KE loss = Ft (4.104) 

(b) Consider two systems, A and B (see Fig. 4.49). A is the original setup, while B 
starts with 9 already at zero. Let the pucks in both systems start simultaneously 
at x = 0. As the force F is applied, all four pucks will have the same x(t), because 
the same force in the x-direction, namely F/2, is being applied to every puck at 
all times. After the collision, both systems will therefore look exactly the same. 
Let the collision of the pucks occur at x = d. At this point, F(d + t) work 
has been done on system A, because the center of the string (where the force 
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is applied) ends up moving a distance l more than the masses. However, only 
Fd work has been done on system B. Since both systems have the same kinetic 
energy after the collision, the extra Ft work done on system A must be what is 
lost in the collision. 

Remark: The reasoning in this second solution makes it clear that this FI result 
holds even if we have many masses distributed along the string, or if we have a rope 
with a continuous mass distribution (so that the rope flops down, as in Fig. 4.50). 
The only requirement is that the mass be symmetrically distributed around the mid¬ 
point. Analyzing this more general setup along the lines of the first solution would be 
extremely tedious, to say the least. & 


5. V{x) vs. a hill 

First solution: Consider the normal force, N, acting on the bead at a given point. 
Let 0 be the angle that the tangent to V(x) makes with the horizontal, as shown in 
Fig. 4.51. The horizontal F = ma equation is 


(4.105) 


The vertical F = ma equation is 



Figure 4.50 



Figure 4.51 


N cos 6 — mg = my =>■ N cos 9 = mg + my. 
Dividing eq. (4.105) by eq. (4.106) gives 


(4.106) 


But tan# = V'{x). Therefore, 


x = ~{g + y)V'. 


(4.107) 


(4.108) 


We see that this is not equal to —gV'. In fact, there is in general no way to construct 
a curve with height y(x) that gives the same horizontal motion that a 1-D potential 
V(x) gives, for all initial conditions. We would need (g + y)y' = V, for all x. But 
at a given x, the quantities V' and y' are fixed, whereas y depends on the initial 
conditions. For example, if there is a bend in the wire, then y will be large if y is 
large. And y depends (in general) on how far the bead has fallen. 

Eq. (4.108) holds the key to constructing a situation that does give the x = —gV' 
result for a 1-D potential V(x). All we have to do is get rid of the y term. So here’s 
what we do. We grab our y = V (x) wire and then move it up and/or down in precisely 
the manner that makes the bead stay at the same height with respect to the ground. 
(Actually, constant vertical speed would be good enough.) This will make the y term 
vanish, as desired. Note that the vertical movement of the curve doesn’t change the 
slope, V', at a given value of x. 

Remark: There is one case where x is (approximately) equal to —gV', even when the 
wire remains stationary. In the case of small oscillations of the bead near a minimum of 
V(x), y is small compared to g. Hence, eq. (4.108) shows that x is approximately equal to 
—gV'. Therefore, for small oscillations, it is reasonable to model a particle in a 1-D potential 
mgV(x) as a particle sliding in a valley whose height is given by y = V{x). & 
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Second solution: The component of gravity along the wire is what causes the 
change in speed of the bead. That is, 

-gsine= ( ^, (4.109) 

where 9 is given by 

, V' 1 

tan 9=V(x) => sin 9 = , cos 9 = . (4.110) 

v ' Vi - V' z \ I - v rl y 

Wo are, however, not concerned with the rate of change of v, but rather with the rate 
of change of x. In view of this, let us write v in terms of x. Since x = v cos 9, we have 
v = x/ cos, 9 = xVi + V n . (Dots denote d/dt. Primes denote d/dx.) Therefore, eq. 
(4.109) becomes 



Hence, x is given by 

.. _ -gV’ x.V'{dVfdt) 

X ~ 1 + V' 2 1 + V' 2 

We’ll simplify this in a moment, but first a remark. 


(4.111) 


(4.112) 


Remark: A common incorrect solution to this problem is the following. The acceleration 
along the curve is g sin# = —g{V'/Vl % V ’ 2 ). Calculating the horizontal component of this 
acceleration brings in a factor of cos# = 1/di + V ' 2 . Therefore, we might think that 


-gV 

' 1 + V' 2 


(incorrect). (4.113) 

e have missed the second term in eq. (4.112). Where is the mistake? The e 


But v 

that we forgot to take into account the possible change in the curve’s slope. (Eq. (4.113) 
is true for straight lines.) We addressed only the acceleration due to a change in speed. We 
forgot to consider the acceleration due to a change in the direction of motion. (The term we 
missed is the one with dV'/dt.) Intuitively, if we have sharp enough bend in the wire, then 
x can change at an arbitrarily large rate, even if v is roughly constant. In view of this fact, 
eq. (4.113) is definitely incorrect, because it is bounded (by g/ 2, in fact). & 

To simplify eq. (4.112), note that V' = dV/dx = (dV/ dt) / (dx / dt) = Vfx. Therefore, 

., r ,dV' . V ,d(v 

xV -TT = xV — — 

dt dt 1 ~ 


(4.114) 


= yv-v'x^- 
= v'V - V ,2 x. 

Substituting this into eq. (4.112), we obtain 

x = -(g + V)V', 


(4.115) 
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in agreement with eq. (4.108), since y(x) = V{x). 

Eq. (4.115) is valid for a curve V(x) that remains fixed. If we grab the wire and start 
moving it up and down, then the above solution is invalid, because the starting point, 
eq. (4.109), rests on the assumption that gravity is the only force that does work on 
the bead. But if we move the wire, then the normal force also does work. 

It turns out that for a moving wire, we simply need to replace the V in eq. (4.115) by 
y. This can be seen by looking at things in the (instantaneously inertial) vertically- 
moving frame in which the wire is at rest. In this new frame, the normal force does 
no work, so the above solution is valid. And in this new frame, y = V. Eq. (4.115) 
therefore becomes x = —(g + y)V'. Shifting back to the lab frame (which moves at 
constant speed with respect to the instantaneous inertial frame of the wire) doesn’t 
change y. We thus arrive at eq. (4.108), valid for a stationary or vertically moving 


6. Constant y 

By conservation of energy, the bead’s speed at any time is given by (note that y is 
negative here) 

j™ 2 + mgy = ^touq => v = yjv%- 2 gy . (4.116) 

The vertical component of the speed is y = v sin 9, where tan 6 = y' = dy/dx is the 
slope of the wire. Hence, sin 6 = y 1 /y/l. + |p. The requirement y = —no, which is 
equivalent to usin0 = —no, may therefore be written as 



(4.117) 
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8. Hanging mass 

We will calculate the equilibrium point yo, and then use u> = \fV"{yo)/m. The 
derivative of V is 

V'(y) = ky + mg. (4.123) 

Therefore, V’(y) = 0 when y = —mg/k = yo. The second derivative of V is 


V"(y) = k. 



(4.124) 

(4.125) 



Figure 4.52 


Remark: This is independent of yo, which is what we expect. The only effect of gravity is 
to change the equilibrium position. If y r is the position relative to yo (so that y = y 0 +y r ), 
then the total force as a function of y r is 

F (y r ) = -k(y 0 + y r ) - mg = -k + Vr'j - mg = - ky r , (4.126) 

so it still looks like a regular spring. (This only works, of course, because the spring force is 
linear.) Equivalently, we can complete the square and write the given potential as 


v{y) = \{y 


mg\ 2 
k ) 


m 2 g 2 

2k 


(4.127) 


The additive constant — m 2 g 2 /2k is irrelevant in determining the curvature (that is, the 
second derivative) of the parabola at the minimum, as is the shift in the origin of y by 
—mg/k. We basically have a mass on a spring in zero gravity, in which case the frequency 
is simply ^Jk/m. * 

9. Zero force inside a sphere 

Let a be the distance from P to piece A, and let b be the distance from P to piece B 
(see Fig. 4.52). Draw the “perpendicular” bases of the cones, and call them A' and 
B'. The ratio of the areas of A’ and B’ is a 2 /b 2 . 

The key point here is that the angle between the planes of A and A! is the same as 
the angle between B and £?'; this is true because the chord between A and B meets 
the circle at equal angles at its ends. So the ratio of the areas of A and B is also 
a 2 /b 2 . But the gravitational force decreases like 1/r 2 , and this effect exactly cancels 
the a 2 /b 2 ratio of the areas. Therefore, the forces at P due to A and B (which can 
be treated like point masses, because the cones are assumed to be thin) are equal in 
magnitude (and opposite in direction, of course). 


10. Escape velocity 


(a) The cutoff case is where the particle barely makes it to infinity, that is, where 
its speed is zero at infinity. Conservation of energy for this situation gives 


GMm 

R 


0 + 0 . 


(4.128) 


In other words, the initial kinetic energy, mv 2 sc / 2, must account for the gain in 
potential energy, GMm/R. Therefore, 



(4.129) 
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In terms of the acceleration, g = GM/R 2 , at the surface of a planet, we can 
write v esc as v esc = \/2gR. Using M = 4npR 3 /3, we can also write it as 
v esc = y/8nGR?p/3. So for a given density p, v esc grows like R. 

Using the values of g given in Appendix J, we have: 

For the earth, v esc = \/2gR w i/2/9.8 m/js£}(6.4 ■ 10 6 "iri) jw 11,200 m/s. 

For the moon, v esc = y/2gR « ^2(1.6 m/s 2 ') (1.7 ■ 10®m)» 2,300 m/s. 

For the sun, v esc = y/2gR ^2(270 m/s 2 )(-7.0 • 10® m) ~ 620,000 m/s. 

Remark: Another reasonable question to ask is: what is the escape velocity from the 
sun for an object located where the earth is? (But imagine that the earth isn’t there.) 
The answer is yj2 GM s /Re,s, where R E ,s is the earth-sun distance. Numerically, this 
is ^2(6.67 • 10- 11 )(2 • 10 30 )/(1.5 • 10 11 ) w 42,000 m/s. * 

(b) To get a rough answer, let’s assume that the initial speed of a person’s jump on 
the small planet is the same as it is on the earth. This probably isn’t quite true, 
but it’s close enough for the purposes here. A good jump on the earth is about a 
meter. For this jump, mv 2 /2 = mg( lm). Therefore, v = ^2^(1 m) &s \/2bm/s. 
So we want \/20 = ^8irGR 2 p/3. Using p w 5500 kg/m 3 , we find R « 2.5 km. 
On such a planet, you should tread lightly. 


11. Through the hole 


(a) By symmetry, only the component of the gravitational force perpendicular to 
the plane will survive. A piece of mass dm at radius r on the plane will provide 
a force equal to Gm(dm)/(r 2 + x 2 ). To obtain t he component perpendicular to 
the plane, we must multiply this by x/vr 2 + x 2 . Slicing the plane up into rings 
with mass dm = (27r rdr)a, we find that the total force is 


F(x) 



Gm(2nra dr)x 


(b) If x <C R, then eq. 4.130 gives 


(4.130) 


F(*)* 


F = ma then becomes 




The frequency of small oscillations is therefore 


(4.131) 

(4.132) 


(4.133) 


Remark: For everyday values of R, this is a rather small number because G is so 
small. Let’s determine the rough size. If the sheet has thickness d, a nd if it is m ade of 
a material with density p (per volume), then cr = pd. Hence, ui s= \J2-k pd,G/R. 
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In the above analysis, we assumed that the sheet was infinitely thin. In practice, we 
need d to be much smaller than the amplitude of the motion. But this amplitude must 
be much smaller than R in order for our approximation to hold. So we conclude that 
d <C R. To get a rough upper bound on w, let’s pick d/R, M^'d/lO. And let’s make 
our sheet out of gold (with p « 2 • 10 4 kg/m 3 ). We then find u) « 1 • 10 _3 s _1 , which 
corresponds to an oscillation about every 100 minutes. 

For the analogous system consisting of electrical charges, the frequency is much larger, 
because the electrical force is so much stronger than the gravitational force. 4 
(c) Integrating the force in eq. 4.130 to obtain the potential energy (relative to the 
center of the hole) gives 


V(x) = 

Jo Jo v R 2 + x 2 

= 2naGm\/ R 2 + x 2 1 = 2TraGm(\/ R 2 + x 2 — R) (4.134) 

By conservation of energy, the speed at the center of the hole is given by mv 2 / 2 = 
V(x). Therefore, 

v = 2 

For large x this reduces to v = 2y / naGx. 


(4.135) 


Remark: You can also obtain this last result by noting that for large x, the force 
in eq. (4.130) reduces to F = —2iraGm. This is constant, so it’s basically just like 
a gravitational force F ?g mg' , where g = 2ncrG. But we know that in this familiar 
case, v = s/Wh — yj2(2naG)x , as above. * 


12. Ratio of potentials 

Let p be the mass density of the cube. Let V) cor be the potential energy of a mass m 
at the corner of a cube of side £, and let V) cerl be the potential energy of a mass rn at 
the center of a cube of side t. By dimensional analysis, 


y/ or 


G(pP)m 2 
£ ^ • 


(4.136) 


Therefore, 20 


(4.137) 


But a cube of side £ can be built from eight cubes of side £/2. So by superposition, 
we have 


(4.138) 


because the center of the larger cube lies at a corner of the 
Therefore, 


_ 1 

VT~W% fr¬ 


eight smaller cubes. 


(4.139) 


20 In other words, imagine expanding a cube of side £/2 to one of side l. If we consider corre¬ 
sponding pieces of the two cubes, then the larger piece has 2 3 = 8 times the mass of the smaller. 
But corresponding distances are twice as big in the large cube as in the small cube. Therefore, the 
larger piece contributes 8/2 = 4 times as much to V[ OT as the smaller piece contributes to Vffi- 
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13. Snowball 

All of the snowball’s momentum goes into the earth, which then translates (and 
rotates) a tiny bit faster (or slower, depending on which way the snowball was thrown). 
What about the energy? Let M be the mass of the earth, and let V be the final 
speed of the earth, with respect to the original rest frame of the earth. Then m ^ M 
implies V sa mv/M. The kinetic energy of the earth is therefore 



There is also a rotational kinetic-energy term of the same order of magnitude, but 
that doesn’t matter. Wee see that essentially none of the snowball’s energy goes into 
the earth. It therefore must all go into the form of heat, which melts some of the 
snow. This is a general result for a small object hitting a large object: The large 
object picks up essentially all of the momentum but essentially none of the energy. 

14. Propelling a car 

Let the speed of the car be v(t). Consider the collision of a ball of mass dm with 
the car. In the instantaneous rest frame of the car, the speed of the ball is u — v. In 
this frame, the ball reverses velocity when it bounces, so its change in momentum is 
—2 (u — v ) dm. This is also the change in momentum in the lab frame, because the 
two frames are related by a given speed at any instant. Therefore, in the lab frame 
the car gains a momentum of 2 (u — v ) dm from each ball that hits it. The rate of 
change in momentum of the car (that is, the force) is thus 


dp 

dt 


= 2a'(u — v), 


(4.141) 


where a' = dm/dt is the rate at which mass hits the car. a' is related to the given 
a by a' = a(u — v)/u, because although you throw the balls at speed u, the relative 
speed of the balls and the car is only (u — v). We therefore have 



2(it — v) 2 c 


= %-f* 

Mu J 0 
_ 2 at 

Mu 

(Mh 

1 + AT 


(4.142) 


Note that v —> u as t —> oo, as it should. Integrating this speed to obtain the position 
gives 

x (t) = ut - ^ln(l+ . (4.143) 

We see that even though the speed approaches u, the car will eventually be an arbi¬ 
trarily large distance behind a ball with constant speed u (for example, pretend that 
your first ball misses the car and continues forward at speed u). 
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15. Propelling a car again 

We can carry over some of the results from the previous problem. The only change in 
the calculation of the force on the car is that since the balls don’t bounce backwards, 
we don’t pick up the factor of 2 in eq. (4.141). The force on the car is therefore 


dv (u — v) 2 a 


(4.144) 


where m(t) is the mass of the car-plus-contents, as a function of time. The main 
difference between this problem and the previous one is that this mass m changes 
because the balls are collecting inside the car. As in the previous problem, the rate 
at which the balls enter the car is a' = a(u — v ) /it. Therefore, 


dm _ (u — v)a 
dt u 


(4.145) 


We must now solve the two preceding differential equations. Dividing eq. (4.144) by 
eq. (4.145), and separating variables, gives 21 


Jo U — V J M 



(4.147) 


Note that to —> oo as v —> it, as it should. Substituting this value of to into either eq. 
(4.144) or eq. (4.145) gives 


Note that v 
gives 


v dv 
(■u - v) 3 


2(u - v) 2 2u 2 


T adt 

Jo M v? 

at 

Mu 2 



(4.148) 


u as t —> oo, as it should. Integrating this speed to obtain the position 


x(t ) = ut 


Mu f 2at 


(4.149) 


Remark: For a given t, the v(t) in eq. (4.148) is smaller than the v(t) in eq. (4.142). This 
makes sense, because the balls have less of an effect on v(t), because now (1) they don’t 
bounce back, and (2) the mass of the car-plus-contents is larger. £ 

16. Leaky bucket 

(a) First Solution: The initial position is x = L. The given rate of leaking implies 
that the mass of the bucket at position x is to = M(x/L). Therefore, F = ma 

21 We can also quickly derive this equation by writing down conservation of momentum for the 
time interval when a mass dm enters the car: 

dmu + mv = (m + dm) (v + dv) . (4.146) 

This yields eq. (4.147). But we still need to use one of eqs. (4.144) and eq. (4.145). 
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gives — T = ( Mx/L)x. Writing the acceleration as vdv/dx, and separating 
variables and integrating, gives 


-r-= r«d 

Jl x Jo 


y at position x is therefore 


In terms of the fraction z = x/L, we have E = —TLzhiz. Setting dE/dz 
to find the maximum gives 


Note that both E max and its locatio 


e independent of M. 


Remark: We began this solution by writing down /•" — ma, where m is the mass 
of the bucket. You may be wondering why we didn’t use F ~ dp/dt, where p is 
the momentum of the bucket. This would certainly give a different result, because 
dp/dt = d(mv)/dt = ma + ( dm/dt)v . We used F = ma because at any instant, the 
mass m is what is being accelerated by the force F . 

If you want, you can imagine the process occurring in discrete steps: The force pulls 
on the mass for a short period of time, then a little piece falls off. Then the force pulls 
again on the new mass, then another little piece falls off. And so on. In this scenario, 
it is clear that F = ma is the appropriate formula, because it holds for each step in 
the process. 

It is indeed true that F = dp/dt, if you let F be total force in the problem, and let 
p be the total momentum. The tension T is the only horizontal force in the problem, 
because we’ve assumed the ground to be frictionless. However, the total momentum 
consists of both the sand in the bucket and the sand that has leaked out and is sliding 
along on the ground. If we use F = dp/dt, where p is the total momentum, then we 
obtain 

_ dpbucket dpieaked _ ( dm \ ( dm\ _ . . 


as expected. (Note that — dm/dt is a positive quantity.) See Appendix E for further 
discussion on the uses of F = ma and F = dp/dt. & 


Second solution: Consider a small time interval during which the bucket 
moves from x to x + dx (where dx is negative). The bucket’s kinetic energy 
changes by (—T) dx (this is a positive quantity) due to the work done by the 
spring, and also changes by a fraction dx/x (this is a negative quantity) due to 
the leaking. Therefore, dE = —T dx + E dx/x, or 


1 solving this differential equation, it is convenient to introduce the variable 
= E/x. Then E' = xy' + y, where a prime denotes differentiation with respect 
) x. Eq. (4.154) then becomes xy' = —T, which gives 


rE/x rx 

/ dy=-T 

Jo Jl 


as in the first solution. 
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(b) From eq. (4.150), the speed is v = y/2TL/M \J— In z, where 2 = x/L. There¬ 
fore, the magnitude of the momentum is 


Setting dp/dz = 0 to find the maximum gives 


We see that the location of p max is independent of M, T, and L, but its value 


Remark: B max occurs at a later time (that is, closer to the wall) than p max . This 
is because v matters more in E = mv 2 /2 than it does in p = mv. As far as E is 
concerned, it is beneficial for the bucket to lose a little more mass if it means being 
able to pick up a little more speed (up to a certain point). & 

Another leaky bucket 

(a) F = ma says that — T = mx. Combining this with the given dm/dt = bx yields 
mdm = —bT dt. Integration then gives m 2 /2 = C — bTt. But m = M when 
t = 0, so we have C = M 2 / 2. Therefore, 

m(t) = VM 2 - 2bTt. (4.158) 

This holds for t < M 2 /2bT, provided that the bucket hasn’t hit the wall yet. 

(b) The given equation dm/dt = bx = bdv/dt integrates to v = m/b+D. But v = 0 
when m = M, so we have D = —M/b. Therefore, 


At the instant right before all the sand leaves the bucket, we have m = 0. 
Therefore, v = —M/b. 

Integrating v(t) to obtain x(t), we find 

. , -(M 2 - 2bTt) 3 / 2 M T M 3 . 

x (t)~ -- T t + L + Wr ’ (4 ‘ 160) 

where the constant of integration has been chosen to satisfy x = L when t = 0. 
Solving for t in terms of to from eq. (4.158), substituting the result into eq. 
(4.160), and simplifying, gives 


(M — m) 2 (M + 2m) 


(c) Using eq. (4.159), the kinetic energy i 


2 = m(m -M) 2 . 


Taking the derivative dE/dm to find the maximum, we obtai 


(4.163) 
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(d) Using eq. (4.159), the momentum is 


1 


Taking the derivative to find the maximum magnitude, we obtain 

M . . M 2 

m=— => |p| max = - 


4 b ' 


(e) We want x = 0 when m = 0. Eq. (4.161) then gives 

„ r M 3 / 


(4.164) 


(4.165) 


(4.166) 


18. Right angle in billiards 

Let v be the initial velocity, and let vi and V 2 be the final velocities. Conservation 
of momentum and energy give 

mV = TOVi + TOV 2, 

^m(v-v) = ^m(v 1 -v 1 ) + ^rn(v 2 -v 2 ). (4.167) 

Substituting the v from the first equation into the second, and using (vi + V 2 ) • (vi + 
V 2 ) = vi • vi + 2vi • v 2 + v 2 • v 2 , gives 


vi • v 2 = 0. (4.168) 

In other words, the angle between vj and V 2 is 90°. (Or vi = 0, which means the 
incoming mass stops because the collision is head-on. Or V 2 = 0, which means the 
masses miss each other.) 

19. Bouncing and recoiling 

Let Vi be the speed of the ball after the ith bounce, and let V) be the speed of the 
block right after the ith bounce. Then conservation of momentum gives 

mvi = MV i+1 - mv i+1 . (4.169) 

But Theorem 4.3 says that Vi = V)+i + «i+i- Solving this system of two linear 
equations gives 

Vi+1 = ^ + rn* = ^l + ~ ( 1 “ 2e )^> and ^+1 ~ 2ev i > (4-170) 

where e = m/M -C 1. This expression for Vj + i implies that the speed of the ball after 
the nth bounce is 

v n = (1 - 2e) n v 0 . (4.171) 

The total distance traveled by the block can be obtained by looking at the work done 
by friction. Eventually, the ball has negligible energy, so all of its initial kinetic energy 
goes into heat from friction. Therefore, mv §/2 = Ffd = (/ nMg)d , which gives 


(4.172) 
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To find the total time, we can add up the times, t n , the block moves after each bounce. 
Since force times time equals the change in momentum, we have Fft n = MV n , and 
so ( p,Mg)t n = M(2ev n -i) = 2Me(l — 2e) n ~ 1 vo. Therefore, 



Remarks: This t is independent of the masses. Note that it is much larger than the 
result obtained in the case where the ball sticks to the block on the first hit, in which case 
the answer is mvo/(pMg). The total time is proportional to the total momentum that the 
block picks up, and the present answer is larger because the wall keeps transferring positive 
momentum to the ball, which then transfers it to the block. 

The calculation of d above can also be done by adding up the geometric series of the distances 
moved after each bounce. Note that d is the same as it would be in the case where the ball 
sticks to the block on the first hit. The total distance is proportional to the total energy that 
the block picks up, and in both cases the total energy given to the block is mv% /2. The wall 
(which is attached to the very massive earth) transfers essentially no energy to the ball. Jit 


20. Drag force on a sheet 

(a) We will set v = 0 here. When the sheet hits a particle, the particle acquires a 
speed of essentially 2V. This follows from Theorem 4.3, or by working in the 
frame of the heavy sheet. The momentum of the particle is then 2 mV. In time 
t, the sheet sweeps through a volume AVt, where A is the area of the sheet. 
Therefore, in time t, the sheet hits AVtn particles. The sheet therefore loses 
momentum at a rate of dP/dt = (AVn)(2mV). But F = dP/dt, so the force 
per unit area is 

j = 2 nmV 2 = 2 pV 2 , (4.174) 

where p is the mass density of the particles. We see that the force depends 
quadratically on V. 

(b) If v » V, the particles now hit the sheet on both sides. Note that we can’t set V 
exactly equal to zero here, because we would obtain a result of zero and miss the 
lowest-order effect. In solving this problem, we need only consider the particles’ 
motions in the direction of the sheet’s motion. As stated in the problem, we will 
assume that all velocities in this direction are equal to ±v/2. 

Consider a particle in front of the sheet, moving backward toward the sheet. 
The relative speed between the particle and the sheet is v/2 + V. This relative 
speed simply reverses direction during the collision, so the change in momentum 
of this particle is 2m(v/2 + V). We have used the fact that the speed of the 
heavy sheet is essentially unaffected by the collision. The rate at which particles 
collide with the sheet is A(v/2 + V) (n/2), from the reasoning in part (a). The 
n/2 factor comes from the fact that half of the particles move toward the sheet, 
and half move away from it. 

Now consider a particle in back of the sheet, moving forward toward the sheet. 
The relative speed between the particle and the sheet is v/2 — V. This relative 
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speed simply reverses direction during the collision, so the change in momentum 
of this particle is —2m(v/2 — V). And the rate at which particles collide with 
the sheet is A(v/2 — V)(n/2). 

Therefore, the force per unit area on the sheet is 

F _ 1 dP 

A A dt 

= Q(«/ 2 + V)) (MV2 + V)) + Q(i>/2 - V)) ( - 2m(v/2 - V )) 

= 4nm(v/2)V 

m 2 pvV. (4.175) 

We see that the force depends linearly on V. The fact that it agrees with the 
result in part (a) in the case of v = V is coincidence. Neither result is valid 
when v = V. 


21. Drag force on a cylinder 

Consider a particle that makes contact with the cylinder at an angle 6 with respect 
to the line of motion. In the frame of the heavy cylinder (see Fig. 4.53), the particle 
comes in with velocity —V and then bounces off with a horizontal velocity component 
of V cos 26. So in this frame (and therefore also in the lab frame, because the heavy 
cylinder is essentially unaffected by the collision), the particle increases its horizontal 
momentum by mV( 1 + cos 26). 

The area on the cylinder that lies between 6 and 6 + d6 sweeps out volume at a rate 
(Rd6 cos 6)V£, where £ is the length of the cylinder. The cos 6 factor here gives the 
projection orthogonal to the direction of motion. 

The force per unit length on the cylinder (that is, the rate of change of momentum, 
per unit length) is therefore 


F 

1 


J (n(Rd6 cos 6)V^(mV(l + cos 26)^ 
6(1 — sin 2 6) d6 


-*li 

2 nmRV 2 J 


(4.176) 


Note that the average force per cross-sectional area, F/(2R£), equals (4/3 )pV 2 . This 
is smaller than the result for the sheet in the previous problem, as it should be, 
because the particles bounce off somewhat sideways in the cylinder case. 

22. Basketball and tennis ball 


(a) Right before the basketball hits the ground, both balls move downward with 
speed (using mv 2 /2 = mgh) 

v = s/2gh. (4.177) 

Right after the basketball bounces off the ground, it moves upward with speed 
v, while the tennis ball still moves downward with speed v. The relative speed 
is therefore 2v. After the balls bounce off each other, the relative speed is still 



cylinder frame 


Figure 4.53 
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2v. This follows from Theorem 4.3, or by working in the frame of the heavy 
basketball. Since the upward speed of the basketball essentially stays equal to 
v, the upward speed of the tennis ball is 2v + v = 3v. By conservation of energy, 
it will therefore rise to a height of H = d + (3v) 2 /(2g). But v 2 = 2 gh, so we 
have 

H = d+9h. (4.178) 

(b) Right before B t hits the ground, all of the balls move downward with speed 
v = s/2gh. 

We will inductively determine the speed of each ball after it bounces off the one 
below it. If Bi achieves a speed of v, after bouncing off B,_i, then what is the 
speed of B i+1 after it bounces off B, ‘! The relative speed of B l+[ and B, (right 
before they bounce) is v + Vi. This is also the relative speed after they bounce. 
Therefore, since B, is still moving upward at essentially speed v.,, we see that 
the final upward speed of B i+1 equals (v + v/) + n*. Thus, 

v i+1 = 2vi + v. (4.179) 

Since v\ = v, we obtain v -2 = 3v (in agreement with part (a)), and then v% = 7v, 
and then = 15u, etc. In general, 


v n = (2 n - l)u, (4.180) 

which is easily seen to satisfy eq. (4.179), with the initial value v\ = v. 

From conservation of energy, B n will bounce to a height of 

H = I+ ^ 2n ~ 1 ^ =£+(2 n -l ) 2 h. (4.181) 

If h is 1 meter, and we want this height to equal 1000 meters, then (assuming £ 
is not very large) we need 2" — 1 > \/l000. Five balls won’t quite do the trick, 
but six will, and in this case the height is almost four kilometers. 

Escape velocity from the earth (which is v esc = \/2gR w 11, 200 m/s) is reached 
when 

Vn > V esc => (2" - l)y/2~gh > sj2gR => n > ln 2 + l^J . (4.182) 

With R = 6.4 • 10 6 m and h = lm, we find n > 12. Of course, the elasticity 
assumption is absurd in this case, as is the notion that one can find 12 balls 
with the property that m\ m2 TO12. 

23. Colliding masses 

(a) By conservation of momentum, the final speed of the combined masses is Mv/(M+ 
m) w (1 — m/M)v, plus higher-order corrections. The final energies are therefore 

1 2 

-mv , 

^Mv 2 - mv 2 . (4.183) 

These energies add up to Mv 2 /2 — mv 2 / 2, which is mv 2 /2 less than the initial 
energy of mass M, namely Mv 2 /2. Therefore, mv 2 /2 is lost to heat. 


p 1 Y m \ 2 

Em = -TO 1- 

m 2 V MJ 
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(b) In this frame, mass m has initial speed v, so its initial energy is E, = mv 2 /2. By 
conservation of momentum, the final speed of the combined masses is mu/(M-j» 
to) w ( m/M)v , plus higher-order corrections. The final energies are therefore 

0, 

0. (4.184) 

This negligible final energy is mv 2 / 2 less than £). Therefore, mv 2 /2 is lost to 
heat, in agreement with part (a). 

24. Pulling a chain 

Let x be the distance your hand has moved. Then x/2 is the length of the moving part 
of the chain, because the chain gets “doubled up”. The momentum of this moving part 
is therefore p = ( ax/2)x. The force that your hand applies is found from F = dp/dt, 
which gives F = ( cr/2){x 2 + xx). But since v is constant, the x term vanishes. The 
change in momentum here is due simply to additional mass acquiring speed v, and 
not due to any increase in speed of the part already moving. Hence, 

F=^f, (4.185) 

which is constant. Your hand applies this force over a total distance 2 L, so the total 
work you do is 

F(2L) = aLv 2 . (4.186) 

The mass of the chain is aL, so its final kinetic energy is (itL)v 2 /2. This is only half 
of the work you do. Therefore, an energy of uLv 2 /2 is lost to heat. 

Each atom in the chain goes abruptly from rest to speed v, and there is no way to 
avoid heat loss in such a process. This is clear when viewed in the reference frame of 
your hand. In this frame, the chain initially moves at speed v and eventually comes 
to rest, piece by piece. All of its initial kinetic energy, (aL)v 2 / 2, goes into heat. 

25. Pulling a rope 

Let x be the position of the end of the rope. The momentum of the rope is then 
p = ( ax)x. F = dp/dt gives (using the fact that F is constant) Ft = p, so we have 
Ft = (ax)x. Separating variables and integrating yields 

/ Ftdt 
Jo 
Ft 2 
2 

tyfYJ/j. (4.187) 

The position therefore grows linearly with time. In other words, the speed is constant, 
and it equals *jF/o. 

Remark: Realistically, when you grab the rope, there is some small initial value of x (call it 
e). The dx integral above now starts at e instead of 0, so x takes the form, x = \JFt 2 /a + e 2 . 
If e is very small, the speed very quickly approaches yjF/o. Even if e is not small, the position 
becomes arbitrarily close to t^/F/a, as t becomes large. The “head-start” of e will therefore 
not help you in the long run. ft 
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26. Raising the rope 

Let y be the height of the top of the rope. Let F(y) be the desired force applied 
by your hand. Consider the moving part of the rope. The net force on this part is 
F — ( cry)g , with upward taken to be positive. The momentum is (ay)y. Equating the 
net force on the moving part with the rate of change in momentum gives 22 


F - ayg 


d(ayy) 

dt 

ayy + ay 2 . 


(4.188) 


But y = 0, and y = v. Therefore, 


F = ayg 


(4.189) 


The work that you do is the integral of this force, from y = 0 to y = L. Since v is 
constant, we have 


W = J {vy9 + vv 2 )dy= ( ^Y?-+aLv 2 . (4.190) 

The final potential energy of the rope is (aL)g(L/ 2), because the center of mass is 
raised by distance L/2. This is the first term in eq. (4.190). The final kinetic energy 
is (aL)v 2 / 2. This accounts for half of the last term. The missing energy, (aL')v' 2 /2, 
is converted into heat. 

27. Falling rope 

(a) First Solution: Let cr be the mass density of the rope. From conservation of 
energy, we know that the rope’s final kinetic energy, which is (aL)v 2 / 2, equals 
the loss in potential energy. This loss equals ( aL)(L/2)g , because the center of 
mass falls a distance L/2. Therefore, 


v = y/gL. 


(4.191) 


This is the same as the speed obtained by an object that falls a distance L/2. 
Note that if the initial piece hanging down through the hole is arbitrarily short, 
then the rope will take an arbitrarily long time to fall down. But the final speed 
will be still be (arbitrarily close to) yfgL. 


Second Solution: Let x be the length that hangs down through the hole. The 
gravitational force on this length, which is (ax)g, is responsible for changing the 
momentum of the entire rope, which is ( aL)x. Therefore, F = dp/dt gives 
(ax)g = ( aL)x , which is simply the F = rna equation. Hence, x = ( g/L)x , and 
the general solution to this equation is 

x(t) = Ae t 'f°/ Z + (4.192) 

Note that if e is the initial value for x, then A = B = e/2 satisfies the initial con¬ 
ditions x(0) = e and ±(0) = 0, in which case we may write x(t) = ecosh {ty/g/L). 
But we won’t need this information in what follows. 

22 If you instead wanted to use the entire rope as your system, then eq. (4.188) would still look 
the same, because the net force is the same (the extra weight of the rope on the floor is cancelled 
by normal force from the floor), and the momentum is the same (only the moving part has nonzero 
P)- 
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Let T be the time for which x(T') = L. If e is very small, then T will be 
very large. But for large t, 2i we may neglect the negative-exponent term in eq. 
(4.192). We then have 

xK,Ael'f^ J ==> x « Ae^^jL « x^/JJl (for large t). 

(4.193) 

When x = L, we obtain 

x{T) = L^jL= s ^L, (4.194) 

in agreement with the first solution. 

(b) Let a be the mass density of the rope, and let x be the length that hangs down 
through the hole. The gravitational force on this length, which is ( ax)g , is 
responsible for changing the momentum of the rope. This momentum is (ax)x, 
because only the hanging part is moving. Therefore, F = dp/dt gives 

xg = xx + x 2 . (4.195) 

Note that F = ma gives the wrong equation, because it neglects the fact that 
the moving mass, ax, is changing. It therefore misses the second term on the 
right-hand side of eq. (4.195). In short, the momentum of the rope increases 
because it is speeding up (which gives the xx term) and because additional mass 
is continually being added to the moving part (which gives the x 2 term, as you 
can show). 

To solve eq. (4.195) for x(t), note that g is the only parameter in the equation. 
Therefore, the solution for x(t) can involve only c/’s and f’s. 24 By dimensional 
analysis, x(t) must then be of the form x(t') = bgt 2 , where b is a numerical 
constant to be determined. Plugging this expression for x(t) into eq. (4.195) 
and dividing by g 2 t 2 gives b = 2 b 2 + 4b 2 . Therefore, b = 1/6, and our solution 
may be written as 

x (t) = 2 (|) t 2 . (4.196) 

This is the equation for something that accelerates downward with acceleration 
g’ = g/ 3. The time the rope takes to fall a distance L is then given by L = g't 2 /2, 
which yields t = y/2L/g'. The final speed in thus 

v = g't=^/2Lg' = S j^. (4.197) 

This is smaller than the \fgL result from part (a). We therefore see that although 
the total time for the scenario in part (a) is very large, the final speed in that 
case is in fact larger than that in the present scenario. 

Remarks: Using eq. (4.197), you can show that 1/3 of the available potential energy 
is lost to heat. This inevitable loss occurs during the abrupt motions that suddenly 
bring the atoms from zero to non-zero speed when they join the moving part of the 

23 More precisely, for t \JL/g. 

24 The other dimensionful quantities in the problem, L and a, do not appear in eq. (4.195), so 
they cannot appear in the solution. Also, the initial position and speed (which will in general 
appear in the solution for x(t), because eq. (4.195) is a second-order differential equation) do not 
appear in this case, because they are equal to zero. 
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rope. The use of conservation of energy is therefore not a valid way to solve this 
problem. 

You can show that the speed in part (a)’s scenario is smaller than the speed in part 
(b)’s scenario for x less than 2L/3, but larger for x greater than 2L/3. 

28. The raindrop 

Let p be the mass density of the raindrop, and let A be the average mass density in 
space of the water droplets. Let r(t), M(t ), and v{t) be the radius, mass, and speed 
of the raindrop, respectively. 

We need three equations to solve for the above three unknowns. The equations we 
will use are two different expressions for dM/dt, and the F = dp/dt expression for 
the raindrop. 

The first expression for M is obtained by simply taking the derivative of M = 
(4/3)7t r 3 p, which gives 

M = 4nr 2 rp (4.198) 

= 3 M-. (4.199) 


The second expression for M is obtained by noting that the change in M is due to 
the acquisition of water droplets. The raindrop sweeps out volume at a rate given by 
its cross-sectional area times its velocity. Therefore, 

M = nr 2 vX. (4.200) 


The F = dp/dt equation is found as follows. The gravitational force is Mg, and the 
momentum is Mv. Therefore, F = dp/dt gives 

Mg = Mv + Mv. (4.201) 


We now have three equations involving the three unknowns, r, M, and v. 25 

Our goal is to find v. We will do this by first finding r. Eqs. (4.198) and (4.200) give 


4 P . 

A 

4 P_~ 

A ' 

Plugging eqs. (4.199, 4.202, 4.203) into eq. (4.201) gives 



(4.202) 

(4.203) 


(4.204) 


Therefore, 


gr = 12f 2 + 4rr, (4.205) 

where we have defined g = gX/p, for convenience. The only parameter in eq. (4.205) 
is g. Therefore, r(t) can depend only on g and t. Hence, by dimensional analysis, r 
must take the form 


r = Agt 2 , 


(4.206) 


25 Note that we cannot write down the naive conservation-of-energy equation (which would say 
that the decrease in the water’s potential energy equals the increase in its kinetic energy), because 
mechanical energy is not conserved. The collisions between the raindrop and the droplets are 
completely inelastic. The raindrop will, in fact, heat up. See the remark at the end of the solution. 
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where A is a numerical constant, to be determined. Plugging this expression for r 
into eq. (4.205) gives 

g(Agt 2 ) = \2{2A~gt) 2 +4{A~gt 2 ){2A~g) 

=> A = A8A 2 + 8A 2 . (4.207) 

Therefore, A = 1/56, and so r = 2Ag = g /28 = g\/28p. Eq. (4.203) then gives the 
acceleration of the raindrop as 

v = |, (4.208) 

independent of p and A. 


Remarks: A common invalid solution to this problem is the following, which (incorrectly) 
uses conservation of energy. 

The fact that v is proportional to r (shown in eq. (4.202)) means that the volume swept 
out by the raindrop is a cone. The center of mass of a cone is 1 /4 of the way from the base 
to the apex (as you can show by integrating over horizontal circular slices). Therefore, if M 
is the mass of the raindrop after it has fallen a height h, then an (incorrect) application of 
conservation of energy gives 

l -Mv 2 = Mg^ =** « 2 = f- (4-209) 

Taking the derivative of this (or equivalently, using the general result, v 2 = 2 ad), we obtain 


v = j (incorrect). 


(4.210) 


The reason why this solution is invalid is that the collisions between the raindrop and the 
droplets are completely inelastic. Heat is generated, and the overall kinetic energy of the 
raindrop is smaller than you would otherwise expect. 

Let’s calculate how much mechanical energy is lost (and therefore how much the raindrop 
heats up) as a function of the height fallen. The loss in mechanical energy is 

Ei OBt = Mg^ - \Mv 2 . (4.211) 

Using v 2 = 2(g/7)h, this becomes 

A£ int = E lost = ^ Mgh , (4.212) 

where AIAnt is the gain in internal thermal energy. 

The energy required to heat lg of water by 1 C° is 1 calorie (= 4.18 joules). Therefore, the 
energy required to heat 1 kg of water by 1 C° is « 4200 J. In other words, 

A E int = 4200 M AT, (4.213) 

where M is measured in kilograms, and T is measured in Celsius. Eqs. (4.212) and (4.213) 
give the increase in temperature as a function of h, 

4200 A T=^-gh. (4.214) 


How far must the raindrop fall before it starts to boil? If we assume that the water droplets’ 
temperature is near freezing, then the height through which the raindrop must fall to have 
AT = 100 C° is found from eq. (4.214) to be 

h w 400 km, (4.215) 

which is much larger than the height of the atmosphere. We have, of course, idealized the 
problem. But needless to say, there is no need to worry about getting burned by the rain. 
A typical value for h is a few kilometers, which would raise the temperature by only about 
one degree. This effect, of course, is washed out by many other factors. 
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Chapter 5 

The Lagrangian Method 

Copyright 2004 by David Morin, morin@physics.harvard.edu 

Consider the problem of a mass on the end of a spring. We can solve this, of course, 
by using F = ma to write down mx = —kx. The solutions to this equation are 
sinusoidal functions, as we well know. We can, however, solve this problem by using 
another method which doesn’t explicitly use F = ma. In many (in fact, probably 
most) physical situations, this new method is far superior to using F = ma. You 
will soon discover this for yourself when you tackle the problems for this chapter. 

We will present our new method by first stating its rules (without any justifica¬ 
tion) and showing that they somehow end up magically giving the correct answer. 
We will then give the method proper justification. 


5.1 The Euler-Lagrange equations 


Here is the procedure. Form the following seemingly silly combination of the kinetic 
and potential energies (T and V, respectively), 


L = T-V\ . (5.1) 


This is called the Lagrangian. Yes, there is a minus sign in the definition (a plus 
sign would simply give the total energy). In the problem of a mass on the end of a 
spring, T = mx 2 /2 and V = kx 2 /2, so we have 


L = 


2 mx 


2 


(5.2) 


Now write 


1 1 

(0L\ 

_ dL 

dt 

\dxj 

dx 


(5.3) 


Don’t worry, we’ll show you in Section 5.2 where this comes from. This equation 
is called the Euler-Lagrange (E-L) equation. For the problem at hand, we have 
dL/dx = mx and dL/dx = —kx, so eq. (5.3) gives 


(5.4) 
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which is exactly the result obtained by using F = ma. An equation such as eq. 
(5.4), which is derived from the Euler-Lagrange equation, is called an equation of 
motion. 1 

If the problem involves more than one coordinate, as most problems do, we sim¬ 
ply have to apply eq. (5.3) to each coordinate. We will obtain as many equations as 
there are coordinates. Each equation may very well involve many of the coordinates 
(see the example below, where both equations involve both x and 0). 

At this point, you may be thinking, “That was a nice little trick, but we just 
got lucky in the spring problem. The procedure won’t work in a more general 
situation.” Well, let’s see. How about if we consider the more general problem of 
a particle moving in an arbitrary potential, V(x ) (we’ll just stick to one dimension 
for now). Then the Lagrangian is 

L=^mx 2 — V(x). (5.5) 

The Euler-Lagrange equation, eq. (5.3), gives 

= (5.6) 

dx 

But —dV/dx is simply the force on the particle. So we see that eqs. (5.1) and 
(5.3) together say exactly the same thing that F = ma says, when using a cartesian 
coordinate in one dimension (but this result is in fact quite general, as we will see 
in Section 5.4). 

Note that shifting the potential by a given constant has no effect on the equation 
of motion, because eq. (5.3) involves only derivatives of V. This, of course, is 
equivalent to saying that only differences in energy are relevant, and not the actual 
values, as we well know. 

In a three-dimensional problem, where the potential takes the form V(x,y,z), it 
immediately follows that the three Euler-Lagrange equations (obtained by applying 
eq. (5.3) to x, y, and z) may be combined into the vector statement, 

mx = — VE. (5.7) 

But —VE = F, so we again arrive at Newton’s second law, F = ma, now in three 
dimensions. 

Let’s now do one more example to convince you that there’s really something 
nontrivial going on here. 


Example (Spring pendulum): Consider a pendulum made out of a spring with 
a mass m on the end (see Fig. 5.1). The spring is arranged to lie in a straight line 
(which we can arrange by, say, wrapping the spring around a rigid massless rod). The 

1 The term “equation of motion” is a little ambiguous. It is understood to refer to the second- 
order differential equation satisfied by x, and not the actual equation for a; as a function of t, namely 
x(t) = A cos(<vf+ </>) in this problem, which is obtained by integrating the equation of motion twice. 
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equilibrium length of the spring is t. Let the spring have length £ + x(t), and let its 
angle with the vertical be 6(t). Assuming that the motion takes place in a vertical 
plane, find the equations of motions for x and 9. 

Solution: The kinetic energy may be broken up into the radial and tangential parts, 
so we have 

T = ^m(x 2 + {t + a;) 2 # 2 ). (5.8) 

The potential energy comes from both gravity and the spring, so we have 

V (x, 9) =—mg(£ + x) cos 9+^kx 2 . (5.9) 

The Lagrangian therefore equals 

L = T — V = ^ m(x 2 + (£ + x) 2 9 2 ^ + mg(£ + x) cos 6 — ^fcr 2 . (5.10) 

There are two variables here, x and 6. As mentioned above, the nice thing about the 
Lagrangian method is that we can simply use eq. (5.3) twice, once with x and once 
with 9. Hence, the two Euler-Lagrange equations are 

s(S) = S =* nu= = m{l + x)P + m 9 co S l)-kT, (5.11) 

and 

=> m{l + x) 2 9 + 2 m(£ + x)x9 = —mg(l + x) sin 9. 

=> m(l + x)9 + 2mx9 = — mgsm9. (5-12) 

Eq. (5.11) is simply the radial F = ma equation, complete with the centripetal 
acceleration, —(£ + x)9 2 . The first line of eq. (5.12) is the statement that the torque 
equals the rate of change of the angular momentum (one of the subjects of Chapter 
7) 2 

After writing down the E-L equations, it is always best to double-check them by 
trying to identify them as F = ma or r = d,L/dt equations. Sometimes, however, this 
identification is not obvious. For the times when everything is clear (that is, when 
you look at the E-L equations and say, “Oh, of course!”), it is usually clear only after 
you’ve derived them. The Lagrangian method is generally the safer method to use. 
The present example should convince you of the great utility of the Lagrangian 
method. Even if you’ve never heard of the terms “torque”, “centripetal”, “centrifu¬ 
gal” , or “Coriolis”, you can still get the correct equations by simply writing down the 
kinetic and potential energies, and then taking a few derivatives. 


2 Alternatively, if you want to work in a rotating frame, then eq. (5.11) is the radial F = ma 
equation, complete with the centrifugal force, m(l + x)9 2 . And the third line of eq. (5.12) is the 
tangential 'F'= ma equation, complete with the Coriolis force, —2 mx6. But never mind about this 
now. We’ll deal with rotating frames in Chapter 9. 
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At this point it seems to be personal preference, and all academic, whether you 
use the Lagrangian method or the F = ma method. The two methods produce the 
same equations. However, in problems involving more than one variable, it usually 
turns out to be much easier to write down T and V, as opposed to writing down 
all the forces. This is because T and V are nice and simple scalars. The forces, on 
the other hand, are vectors, and it’s easy to get confused if they point in various 
directions. The Lagrangian method has the advantage that once you’ve written 
down L = T — V , you don’t have to think anymore. All you have to do is blindly 
take some derivatives. 3 

When jumping from high in a tree, 

Just write down del L by del z- 
Take del L by z dot, 

Then f-dot what you’ve got, 

And equate the results (but quickly!) 

But ease and speed of computation aside, is there any fundamental difference be¬ 
tween the two methods? Is there any deep reasoning behind eq. (5.3)? Indeed, 
there is... 


5.2 The principle of stationary action 

Consider the quantity, 

ft 2 

S = J L(x,x,t)dt. (5.13) 

S is called the action. It is a number with the dimensions of (Energy) x (Time). S 
depends on L. and L in turn depends on the function x(t) via eq. (5.1). 4 Given any 
function x(t), we can produce the number S. We’ll just deal with one coordinate, 
x, for now. 

S is called a functional, and is sometimes denoted by S[x'(f)]. It depends on the 
entire function x(t), and not on just one input number, as a regular function /(f) 
does. S can be thought of as a function of an infinite number of values, namely 
all the x{t) for t ranging from ti to t- 2 - If you don’t like infinities, you can imagine 
breaking up the time interval into, say, a million pieces, and then replacing the 
integral by a discrete sum. 

Let us now pose the following question: Consider a function x(t), for 1 1 < f < t2, 
which has its endpoints fixed (that is, x(ti) = x\ and x(t 2 ) = X2, where x\ and x'2 
are given), but is otherwise arbitrary. What function x(t) yields a stationary value 
of S? A stationary value is a local minimum, maximum, or saddle point. 5 

3 Of course, you eventually have to solve the resulting equations of motion, but you have to do 
that when using the F = ma method, too. 

4 In some situations, the kinetic and potential energies in L = T — V may explicitly depend on 
time, so we have included the “i” in eq. (5.13). 

S A saddle point is a point where there are no first-order changes in S, and where some of the 
second-order changes are positive and some are negative (like the middle of a saddle, of course). 
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For example, consider a ball dropped from rest, and consider the function y(t ) 
for 0 < t < 1. Assume that we somehow know that t/(0) = 0 and y( 1) = —g/ 2. 6 
A number of possibilities for y(t) are shown in Fig. 5.2, and each of these can (in 
theory) be plugged into eqs. (5.1) and (5.13) to generate S. Which one yields a 
stationary value of SI The following theorem gives us the answer. 



Theorem 5.1 If the function xq( t) yields a stationary value (that is, a local mini¬ 
mum, maximum, or saddle point) ofS, then 


Figure 


d / dL \ _ dL 
dt \dxoJ dxo 


(5.14) 


It is understood that we are considering the class of functions whose endpoints are 
fixed. That is, x(t\) = x\ and xfo) = X 2 - 


Proof: We will use the fact that if a certain function xq( t) yields a stationary 

value of S, then any other function very close to xo(t) (with the same endpoint 
values) yields essentially the same S, up to first order in any deviations. This is 
actually the definition of a stationary value. The analogy with regular functions is 
that if f(b) is a stationary value of /, then f(b + e) differs from f(b) only at second 
order in the small quantity e. This is true because f'(b) = 0, so there is no first-order 
term in the Taylor series around b. 

Assume that the function xq (i) yields a stationary value of S, and consider the 
function 

x a (t) = xo{t) + a/3(t), (5.15) 

where /3(t ) satisfies (3{t\) = ^{tf) = 0 (to keep the endpoints of the function fixed), 
but is otherwise arbitrary. 

The action S'[x a (f)] is a function of a (the t is integrated out, so S is just a 
number, and it depends on a), and we demand that there be no change in S at first 
order in a. How does S depend on a? Using the chain rule, we have 


i SMt)] = lC Ldt 


* f 

Jt i 

- r 

Jt i 


/ dL dx a dL dx a \ , 

V<Tc a da dx a da ) 


(5.16) 


In other words, a influences S through its effect on x, and also through its effect on 
x. From eq. (5.15), we have 


dx a 

da 


Pi 


and 


dx a 

da 


A 


(5.17) 


’This follows from y = —gt 2 /2, but pretend that we don’t know this formula. 
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so eq. (5.16) becomes 7 


d 

da 


= r 

jt i 




(5.18) 


Now comes the one sneaky part of the proof. You will see this trick many times in 
your physics career. We will integrate the second term by parts. Using 


f dL • , dL n f ( d dL\ n , 


eq. (5.18) becomes 
d 


d . / m f t2 ( dL d dL\ a , 8L n 

= 1 (^j t wJ 0dt+ aif 


(5.19) 


(5.20) 


But (3(t{) = [3(0) = 0, so the last term (the “boundary term”) vanishes. We now 
use the fact that (d/da)S[x a (t)] must be zero for any function pit), because we are 
assuming that xq (t) yields a stationary value. The only way this can be true is if 
the quantity in parentheses above (evaluated at a = 0) is identically equal to zero, 
that is, 


d / dL \ _ dL 
dt \dxoJ dx o 


(5.21) 


The E-L equation, eq. (5.3), therefore doesn’t just come out of the blue. It is a 
consequence of requiring that the action be at a stationary value. We may therefore 
replace F = ma by the following principle. 


• The Principle of Stationary-Action: 

The path of a particle is the one that yields a stationary value of the action. 


This principle is equivalent to F = ma because the above theorem shows that if 
(and only if, as you can show by working backwards) we have a stationary value of 
S, then the E-L equations hold. And the E-L equations are equivalent to F = ma 
(as we showed for Cartesian coordinates in Section 5.1 and which we’ll prove for 
any coordinate system in Section 5.4). Therefore, “stationary-action” is equivalent 
to F = ma. 

If we have a multidimensional problem, where the lagrangian is a function of 
the variables x\(t),X 2 (t ),..., then the above principle of stationary action is still all 
we need. With more than one variable, we can now vary the path by varying each 
coordinate (or combinations thereof). The variation of each coordinate produces an 
E-L equation which, as we caw in the cartesian case, is equivalent to an F = ma 
equation. 

Given a classical mechanics problem, we can solve it with F = ma, or we can 
solve it with the E-L equations, which derive from the principle of stationary action 

7 Note that nowhere do we assume that x a and x a are independent variables. The partial 
derivatives in eq. (5.17) are very much related, in that one is the derivative of the other. The use 
of the chain rule in eq. (5.16) is still perfectly valid. 
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(often called the principle of “minimal action”, but see the third remark below). 
Either method will get the job done. But as mentioned at the end of Section 5.1, it 
is often easier to use the latter, because it avoids the use of force, and it’s easy to 
get confused if you have forces pointing in all sorts of complicated directions. 

It just stood there and did nothing, of course, 

A harmless and still wooden horse. 

But the minimal action 
Was just a distraction; 

The plan involved no use of force. 

Let’s now return to the example of a ball dropped from rest, mentioned above. 
The Lagrangian is L = T — V = my 2 / 2 — mgy, so eq. (5.21) gives y = —g, which is 
simply the F = ma equation, as expected. The solution is y(t) = —gt 2 /2 + v$t+yo, 
as we well know. But the initial conditions tell us that vo = yo = 0, so our solution is 
y(t) = —gt 2 /2. You are encouraged to verify explicitly that this y(t) yields an action 
that is stationary with respect to variations of the form, say, y(t) = —gt 2 /2+et(t—l), 
which also satisfies the endpoint conditions (this is the task of Exercise 3). There 
are, of course, an infinite number of other ways to vary y(t), but this specific result 
should help convince you of the general result of Theorem 5.1. 

Note that the stationarity implied by the Euler-Lagrange equation, eq. (5.21), 
is a local statement. It gives information only about nearby paths. It says nothing 
about the global nature of how the action depends on all possible paths. If we find 
that a solution to eq. (5.21) happens to produce a local minimum, there is no reason 
to conclude that it is a global minimum, although in many cases it turns out to be. 

Remarks: 

1. Theorem 5.1 is based on the assumption that the ending time, t%, of the motion is 
given. But how do we know this final time? Well, we don’t. In the example of a ball 
thrown upward, the total time to rise and fall back to your hand can be anything, 
depending on the ball’s initial speed. This initial speed will show up as an integration 
constant when solving the E-L Equations. The motion has to end sometime, and 
the principle of stationary action says that for whatever time this happens to be, the 
physical path has a stationary action. 

2. Theorem 5.1 shows that we can explain the E-L equations by the principle of sta¬ 
tionary action. This, however, simply shifts the burden of proof. We are now left 
with the task of justifying why we should want the action to have a stationary value. 
The good news is that there is a very solid reason for this. The bad news is that the 
reason involves quantum mechanics, so we won’t be able to discuss it properly here. 
Suffice it to say that a particle actually takes all possible paths in going from one 
place to another, and each path is associated with the complex number e lS / n (where 
Ti = 1.05 ■ 10 -34 Js is Planck’s constant). These complex numbers have absolute value 
1 and are called “phases”. It turns out that the phases from all possible paths must 
be added up to give the “amplitude” of going from one point to another. The absolute 
value of the amplitude must then be squared to obtain the probability. 8 

8 This is one of those remarks that is completely useless, because it is incomprehensible to those 
who haven’t seen the topic before, and trivial to those who have. My apologies. But this and the 
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The basic point, then, is that at a non-stationary value of S, the phases from different 
paths differ (greatly, because h is so small) from one another, which effectively leads to 
the addition of many random vectors in the complex plane. These end up cancelling 
each other, yielding a sum of essentially zero. There is therefore no contribution to 
the overall amplitude from non-stationary values of S. Hence, we do not observe 
the paths associated with these S’s. At a stationary value of .S', however, all the 
phases take on essentially the same value, thereby adding constructively instead of 
destructively. There is therefore a non-zero probability for the particle to take a path 
that yields a stationary value of S. So this is the path we observe. 

3. But again, the preceding remark simply shifts the burden of proof one step further. 
We must now justify why these phases e' s / h should exist, and why the Lagrangian 
that appears in S should equal T — V. But here’s where we’re going to stop. 

4. The principle of stationary action is sometimes referred to as the principle of “least” 
action, but this is misleading. True, it is often the case that the stationary value 
turns out to be a minimum value, but it need not be, as we can see in the following 
example. 

Consider a harmonic oscillator. The Lagrangian is 

L = ^rni; 2 — ^ kx 2 . (5.22) 

Let Xq (£) be a function which yields a stationary value of the action. Then we know 
that xo(t) satisfies the E-L equation, mx o = —kx o- 

Consider a slight variation on this path, x 0 (t) + £(£), where £(£) satisfies £(£i) = 
£(£2) = 0. With this new function, the action becomes 

= j t (^(xo + ^ot + i 2 ) ~ ^(zo + 2a;o£ + £ 2 )) dt. (5.23) 

The two cross-terms add up to zero, because after integrating the xo£ term by parts, 
their sum is 

mx 0 ^ — j (mxo + kxo)^ dt. (5-24) 

The first term is zero, due to the boundary conditions on £(£). The second term is 
zero, due to the E-L equation. We’ve basically just reproduced the proof of Theorem 
5.1 for the special case of the harmonic oscillator here. 

The terms involving only xq give the stationary value of the action (call it So). To 
determine whether So is a minimum, maximum, or saddle point, we must look at the 
difference, 

AS = S £ - S 0 = 2 f 2 - k ?) dt. (5.25) 

It is always possible to find a function £ that makes AS positive. Simply choose £ to 
be small, but make it wiggle very fast, so that £ is large. Therefore, it is never the 
case that So is a maximum. Note that this reasoning works for any potential, not just 
a harmonic oscillator, as long as it is a function of only position (that is, it contains 
no derivatives, as we always assume). 

following remarks are by no means necessary for an understanding of the material in this chapter. 
If you’re interested in reading more about these quantum mechanics issues, you should take a look 
at Richard Feynman’s book, QED. Feynman was, after all, the one who thought of this idea. 
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You might be tempted to use the same line of reasoning to say that it is also always 
possible to find a function £ that makes AS negative, by making £ large and £ small. If 
this were true, then we could put everything together and conclude that all stationary 
points are saddle points, for a harmonic oscillator. However, it is not always possible to 
make £ large enough and £ small enough so that AS is negative, due to the boundary 
conditions £(ti) = CC) = 0. If £ changes from zero to a large value and then back to 
zero, then £ may also have to be large, if the time interval is short enough. Problem 
6 deals quantitatively with this issue. For now, let’s just say that in some cases So 
is a minimum, in some cases it is a saddle point, and it is never a maximum. “Least 
action” is therefore a misnomer. 

5. It is sometimes said that nature has a “purpose”, in that it seeks to take the path that 
produces the minimum action. In view of the second remark above, this is incorrect. 
In fact, nature does exactly the opposite. It takes every path, treating them all on 
equal footing. We simply end up seeing the path with a stationary action, due to the 
way the quantum mechanical phases add. 

It would be a harsh requirement, indeed, to demand that nature make a “global” 
decision (that is, to compare paths that are separated by large distances), and to 
choose the one with the smallest action. Instead, we see that everything takes place 
on a “local” scale. Nearby phases simply add, and everything works out automatically. 
When an archer shoots an arrow through the air, the aim is made possible by all the 
other arrows taking all the other nearby paths, each with essentially the same action. 
Likewise, when you walk down the street with a certain destination in mind, you’re 
not alone. 


When walking, I know that my aim 
Is caused by the ghosts with my name. 

And although I don’t see 
Where they walk next to me, 

I know they’re all there, just the same. 

6. Consider a function, f(x), of one variable (for ease of terminology). Let f(b) be a 
local minimum of /. There are two basic properties of this minimum. The first is 
that /(&) is smaller than all nearby values. The second is that the slope of / is zero 
at b. From the above remarks, we see that (as far as the action S is concerned) the 
first property is completely irrelevant, and the second one is the whole point. In other 
words, saddle points (and maxima, although we showed above that these never exist 
for S) are just as good as minima, as far as the constructive addition of the e tS A l 
phases is concerned. 

7. Given that classical mechanics is an approximate theory, while quantum mechanics 
is the (more) correct one, it is quite silly to justify the principle of stationary action 
by demonstrating its equivalence with F = ma, as we did above. We should be doing 
it the other way around. However, because our intuition is based on F = ma, we’ll 
assume that it’s easier to start with F = ma as the given fact, rather than calling 
upon the latent quantum-mechanical intuition hidden deep within all of us. Maybe 
someday... 

At any rate, in more advanced theories dealing with fundamental issues concerning 
the tiny building blocks of matter (where the action is of the same order of magnitude 
as K), the approximate F = too theory is invalid, and you have to use the Lagrangian 
method. 4 
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Figure 5.3 


V(r) 
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Figure 5.4 


5.3 Forces of constraint 

One nice thing about the Lagrangian method is that we are free to impose any 
given constraints at the beginning of the problem, thereby immediately reducing 
the number of variables. This is always done (perhaps without thinking) whenever 
a particle is constrained to move on a wire or surface, etc. Often we are not concerned 
with the exact nature of the forces doing the constraining, but only with the resulting 
motion, given that the constraints hold. By imposing the constraints at the outset, 
we can find the motion, but we can’t say anything about the constraining forces. 

If we want to determine the constraining forces, we must take a different ap¬ 
proach. The main idea of the strategy, as we will show below, is that we must not 
impose the constraints too soon. This, of course, leaves us with a larger number of 
variables to deal with, so the calculations are more cumbersome. But the benefit is 
that we are able to find the constraining forces. 

Consider the problem of a particle sliding off a fixed frictionless hemisphere 
of radius R (see Fig. 5.3). Let’s say that we are concerned only with finding 
the equation of motion for 9, and not the constraining force. Then we can write 
everything in terms of 9, because we know that the radial distance, r, is constrained 
to be R. The kinetic energy is mR 2 9 2 / 2, and the potential energy (relative to the 
bottom of the hemisphere) is mgr cos 9. The Lagrangian is therefore 

L = ^ mR 2 9 2 — mgR cos 6 , (5.26) 

and the equation of motion, via eq. (5.3), is 

9 = {g/R) sin0, (5.27) 

which is simply the tangential F = ma statement. 

Now let’s say that we want to find the constraining normal force that the hemi¬ 
sphere applies to the particle. To do this, let’s solve the problem in a different way 
and write things in terms of both r and 9. Also (and here’s the critical step), let’s 
be really picky and say that r isn’t exactly constrained to be R, because in the real 
world the particle actually sinks into the hemisphere a little bit. This may seem 
a bit silly, but it’s really the whole point. The particle pushes and sinks inward 
a tiny distance until the hemisphere gets squashed enough to push back with the 
appropriate force to keep the particle from sinking in any more. (Just consider the 
hemisphere to be made of lots of little springs with very large spring constants.) 
The particle is therefore subject to a (very) steep potential due to the hemisphere. 
The constraining potential, V(r), looks something like the plot in Fig. 5.4. The 
true Lagrangian for the system is thus 

L = ^m(r 2 + r 2 0 2 ) — mgr cos 9 — V(r). (5.28) 

(The r 2 term in the kinetic energy will turn out to be insignificant.) The equations 
of motion obtained from varying 9 and r are therefore 

mr 2 9 = mgr sin 9, 
mr = mr9 2 — mg cos 9 — V\r). 


(5.29) 
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Having written down the equations of motion, we will now apply the constraint 
condition that r = R. This condition implies r = r = 0. (Of course, r isn’t really 
equal to R, but any differences are inconsequential from this point onward.) The 
first of eqs. (5.29) then simply gives eq. (5.27), while the second yields 

— ^ I = mg cos 9 — mR9 2 . (5.30) 

dr \ r=R 

But F = —dV/dr is the constraint force applied in the r direction, which is precisely 
the force we are looking for. The normal force of constraint is therefore 

F(9,9) = mg cos 0 — mR9 2 , (5.31) 


which is simply the radial F = ma statement. Note that this result is valid only if 
F(9, 9) > 0. If the normal force becomes zero, then this means that the particle has 
left the sphere, in which case r is no longer equal to R (except at the instant right 
when it leaves). 

Remarks: 


1. What if we instead had (unwisely) chosen Cartesian coordinates, x and y, instead of 
polar coordinat es, r and 91 Since the distance from the particle to the surface of the 
sphere is rj = \/x 2 +■ y 2 — R, we obtain a true Lagrangian equal to 

L = ^m(x 2 + y 2 ) - mgy -V(rj). (5.32) 


The equations of motion are (using the chain rule) 


dVdq 
dy dx ’ 


dV dy 
= - m 9 -^dy- 


(5.33) 


We now apply the constraint condition rj = 0. Since —dV/dr] equals the constraint 
force F, you can show that the equations we end up with (namely, the two E-L 
equations and the constraint equation) are 


my= -mg + F^ 


(5.34) 


These three equations are sufficient to determine the three unknowns x, y, and F as 
functions of the quantities x, x, y, and y (see Exercise 9, which should convince you 
that polar coordinates are the way to go). 


2. You can see from eqs. 
form, 


(5.29) and (5.34) that the E-L equations end up taking the 


al /dL\ _ dL dr? 

dt \ dy,, J dqi + dqi ’ 


(5.35) 


for each coordinate, <?j. Here r? is the constraint equation of the form r? = 0. In our 
hemisphere problem, we have r? = r — R in polar coordinates, and r? = \Jx 1 + y' 2 — R 
in cartesian coordinates. The E-L equations, combined with the r? = 0 condition, 
give us exactly the number of equations (N + 1 of them, where N is the number of 
coordinates) needed to determine all of the N+l unknowns (the <?,: and F), in terms 
of the qi and q t . 
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3. When trying to determine the forces of constraint, you can simply start with eqs. 
(5.35), without bothering to write down V(r]). But you must be careful to make sure 
that r] does indeed represent the distance the particle is from where it should be. In 
polar coordinates, if someone gives you the constraint condition as 7(r — R) = 0, and 
if you use the left-hand side of this as the y in eq. (5.35), then you will get the wrong 
constraint force, o ff'ifaffll fa ctor of 7. Likewise, in cartesian coordinates, writing the 
constraint as y — s/R 2 — x 2 = 0 would give you the wrong force. 

The best way to avoid this problem is, of course, to pick one of your variables as the 
distance the particle is from where it should be (up to an additive constant, as in the 
case of the r in eq. (5.28)). Jh 


5.4 Change of coordinates 


When L is written in terms of cartesian coordinates x,y,z, we showed in Section 5.1 
that the Euler-Lagrange equations are equivalent to Newton’s F = ma equations; 
see eq. (5.7). But what about the case where we use polar, spherical, or some other 
coordinates? The equivalence of the E-L equations and F = ma is not so obvious. 
As far as trusting the E-L equations for such coordinates goes, you can achieve 
peace-of-mind in two ways. You can accept the principle of stationary action as 
something so beautiful and profound that it simply has to work for any choice of 
coordinates. Or, you can take the more mundane road and show through a change 
of coordinates that if the E-L equations hold for one set of coordinates, 9 then they 
also hold for any other coordinates (of a certain form, described below). In this 
section, we will demonstrate the validity of the E-L equations through the explicit 
change of coordinates. 10 

Consider the set of coordinates, 

Xi : (xi,X2, • ■ • ,x N ). (5.36) 


For example, xi,X 2 ,x-j could be the cartesian x.y.z coordinates of one particle, and 
xq.x's.x-fj could be the r ,#,</> polar coordinates of a second particle, and so on. Assume 
that the E-L equations hold for these variables, that is, 


d / dL \ _ dL 
dt \dxi) dxi ’ 


(1 < i < N). 


(5.37) 


We know that there is at least one set of variables for which this is true, namely the 
cartesian coordinates. Consider a new set of variables which are functions of the x % 
and t, 

Qi = qi(xi,X 2 , ■ ■ ■ ,x N ]t). (5.38) 

We will restrict ourselves to the case where the qi do not depend on the x*. (This 
is quite reasonable. If the coordinates depended on the velocities, then we wouldn’t 
be able to label points in space with definite coordinates. We’d have to worry about 

9 We know that they do hold for cartesian coordinates, because we showed in this case that the 
E-L equations are equivalent to F = ma, and we are assuming F = ma to be true. 

10 This calculation is straightforward but a bit messy, so you may want to skip this section and 
just settle for the “beautiful and profound” reasoning. 
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how the particles were behaving when they were at the points. These would be 
strange coordinates indeed.) Note that we can, in theory, invert eqs. (5.38) and 
express the Xi as functions of the qi and t, 

Xi ~ Xj(q\. q - 2: .... q N : /.). (5.39) 


Claim 5.2 If eq. (5.37) is true for the Xi coordinates, and if the Xi and qi are 
related by eqs. (5.39), then eq. (5.37) is also true for the qi coordinates. That is, 


d ( dL \ _ dL 
dt \dq m ) dq rn ’ 


(1 <m<N ). 


(5.40) 


Proof: We have 


dL _^dL dij 
dq m d±i dq m ' 


(5.41) 


(Note that if the x\ depended on the q % , then we would have the additional term, 
Y J (dL/dxi)(dxi/dq rn ), but we have excluded such dependence.) Let’s rewrite the 
dxi/dq m term. From eq. (5.39), we have 


dxi . dx. 

, = t 1 N, qm + 


dt 


(5.42) 


Therefore, 


dii dxi 
dq m dq m ' 


(5.43) 


Substituting this into eq. (5.41) and taking the time derivative of both sides gives 


d / dL \ _ ^ d / dL\ dxi ^ dL d / dxi \ 
dt \dq m ) " dt \dxi) dq m " d±i dt \dq m ) 


(5.44) 


In the second term here, it is legal to switch the order of the total derivative, d/dt, 
and the partial derivative, d/dq m . 

Remark: In case you have your doubts, let’s prove that this switching is legal. 

d f dxi \ 
dt \dq m ) 


as was to be shown. £ 

In the first term on the right-hand side of eq. (5.44), we can use the given 
information in eq. (5.37) and rewrite the ( d/dt)(dL/dxi ) term. Eq. (5.44) then 


d ( dxi \ . d 
^-'d(((.\d^) qk+ di 


dxi ' 
dt \dq m/ 


o I dx t . dxi 
dqff 1 ^ + ~dt 

dx, 
dq r , 


(5.45) 
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becomes 

d 

dt 


as we wanted to show. 

We have therefore demonstrated that if the Euler-Lagrange equations are true 
for one set of coordinates, x% (and they are true for cartesian coordinates), then they 
are also true for any other set of coordinates, satisfying eq. (5.38). For those 
of you who look at the principle of stationary action with distrust (thinking that 
it might be a coordinate-dependent statement), this proof should put you at ease. 
The Euler-Lagrange equations are valid in any coordinates. 

Note that the above proof did not in any way use the precise form of the La- 
grangian. If L were equal to T+V, or 7 T + ttV 2 /T, or any other arbitrary function, 
our result would still be true: If eqs. (5.37) are true for one set of coordinates, then 
they are also true for any other coordinates q % satisfying eqs. (5.38). The point is 
that the only L for which the hypothesis is true at all (that is, for which eq. (5.37) 
holds) is L = T — V (or any constant multiple of this). 

Remark: On one hand, it is quite amazing how little we assumed in proving the above 
claim. Any new coordinates of the very general form (5.38) satisfy the E-L equations, as 
long as the original coordinates do. If the E-L equations had, say, a factor of 5 on the 
right-hand side of eq. (5.37), then they would not hold in arbitrary coordinates. To see this, 
just follow the proof through with the factor of 5. 

On the other hand, the claim is quite believable, if you make an analogy with a function 
instead of a functional. Consider the function f(z ) = z' 2 . This has a minimum at z = 0, 
consistent with the fact that df/dz = 0 at z = 0. But let’s now write / in terms of the 
variable y defined by, say, z = y 4 . Then f(y) = y 8 , and / has a minimum at y = 0, consistent 
with the fact that df/dy equals zero at y = 0. So /' = 0 holds in both coordinates at the 
corresponding points y = z = 0. This is the (simplified) analog of the E-L equations holding 
in both coordinates. In both cases, the derivative equation describes where the stationary 
value occurs. 

This change-of-variables result may be stated in a more geometrical (and friendly) way. 
If you plot a function and then stretch the horizontal axis in an arbitrary manner (which 
is what happens when you change coordinates), then a stationary value (that is, one where 
the slope is zero) will still be a stationary value after the stretching. A picture is worth a 
dozen equations, it appears. 

As an example of an equation that does not hold for all coordinates, consider the pre¬ 
ceding example, but with /' = 1 instead of f = 0. In terms of z, f = 1 when z = 1/2. 
And in terms of y, f = 1 when y = (1/8) 1 / 7 . But the points 2 = 1/2 and y = (1/8) 1 / 7 
are not the same point. In other words, /' = 1 is not a coordinate-independent statement. 
Most equations, of course, are coordinate dependent. The special thing about /' = 0 is that 
a stationary point is a stationary point no matter how you look at it. 11 £ 

11 There is, however, one exception. A stationary point in one coordinate system might be located 
at a kink in another coordinate system, so that /' is not defined there. For example, if we had 


(£) 


A dL dxi A dL dii 
—' dxi dq rn “ dxi dq m 
dL 
dq m ’ 


(5.46) 
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5.5 Conservation Laws 


5.5.1 Cyclic coordinates 


Consider the case where the Lagrangian does not depend 
q k . Then 

d / dL \ _ dL _ 
dt \dq k ) dq k 


Therefore 


8L_ 

dq k 


a certain coordinate 

(5.47) 

(5.48) 


where C is a constant, independent of time. In this case, we say that q k is a cyclic 
coordinate, and that dL/dq k is a conserved quantity (meaning that it doesn’t change 
with time). 

If cartesian coordinates are used, then dL/dx k is simply the momentum, rnx k . 
because x k appears in only the mx\/ 2 term (we exclude cases where V depends 
on x k ). We therefore call dL/dq k the generalized momentum corresponding to the 
coordinate q k . And in cases where dL/dq k does not change with time, we call it a 
conserved momentum. Note that a generalized momentum need not have the units 
of linear momentum, as the angular-momentum examples below show. 


Example 1: Linear momentum 

Consider a ball thrown through the air. In the full three dimensions, the Lagrangian 
is 

L = ^m(x 2 + y 2 + z 2 ) - mgz. (5.49) 

There is no x or y dependence here, so both dL/dx = mx and dL/dy = my are 
constant, as we well know. The fancy way of saying this is that conservation of 
p x = mx arises from spatial translation invariance in the x-direction. The fact that 
the Lagrangian doesn’t depend on x means that it doesn’t matter if you throw the 
ball in one spot, or in another spot a mile down the road. The setup is independent 
of the x value. This independence leads to conservation of p x . 

Example 2: Angular and linear momentum in cylindrical coordinates 

Consider a potential that depends only on the distance to the 2 -axis. In cylindrical 
coordinates, the Lagrangian is 

L=^ m (r 2 + r 2 e 2 + z 2 )-V(r). (5.50) 

There is no 2 dependence here, so dL/dz = mz is constant. Also, there is no 9 
dependence, so dL/86 = mr 2 6 is constant. Since r9 is the speed in the tangential 
direction around the 2 -axis, we see that our conserved quantity, mr(r9), is the angular 
momentum around the 2 -axis. (We actually haven’t defined angular momentum yet; 
we’ll talk about it at great length in Chapters 6-8.) In the same manner as in the 

instead defined y by 2 = then f(y) = y 1 ^ 2 , which has an undefined slope at y = 0. Basically, 
we’ve stretched (or shrunk) the horizontal axis by a factor of infinity at the origin, and this is a 
process that can indeed change a zero slope into an undefined one. But let’s not worry about this. 
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preceding example, conservation of angular momentum around the 2 -axis arises from 
rotation invariance around the 2 -axis. 

Example 3: Angular momentum in spherical coordinates 

In spherical coordinates, consider a potential that depends only on r and 9. (Our 
convention for spherical coordinates will be that 9 is the angle down from the north 
pole, and <f> is the angle around the equator.) The Lagrangian is 

L = X -m(r 2 + r 2 9 2 + r 2 sin 2 9<fi 2 ) - V(r, 9). (5.51) 

There is no <j> dependence here, so dL/d<j) = rrir 2 sin 2 9<j> is constant. Since r sin 9 is 
the distance from the 2 -axis, and since r sin 9<f is the speed in the tangential direction 
around the 2 -axis, we see that our conserved quantity, m(r sin 9){r sin 9(f), is the 
angular momentum around the 2 -axis. 


5.5.2 Energy conservation 

We will now derive another conservation law, namely conservation of energy. The 
conservation of momentum or angular momentum above arose when the Lagrangian 
was independent of x, y, z , 6 , or (f>. Conservation of energy arises when the La¬ 
grangian is independent of time. This conservation law is different from those in the 
above momenta examples, because t is not a coordinate which the stationary-action 
principle can be applied to. You can imagine varying the coordinates x, 0, etc., 
which are functions of t. But it makes no sense to vary t. Therefore, we’re going to 
have to prove this conservation law in a different way. 

Consider the quantity 

'“(SS*)" 4, (5 ' 52) 

E will (usually) turn out to be the energy. We’ll show this below. The motivation 
for this expression for E comes from the theory of Legendre transforms, but we 
won’t get into that here. Let’s just accept the definition in eq. (5.52), and now we’ll 
prove a nice little theorem about it. 

Claim 5.3 If L has no explicit time dependence (that is, if dL/dt = 0), then E is 
conserved (that is, dE/dt = 0 ), assuming the motion obeys the E-L equations (which 
it does). 

Note that there is one partial derivative and one total derivative in this statement. 

Proof: L is a function of the qi, the <ji , and possibly t. Making copious use of the 
chain rule, we have 


dE _ d_(^dL.\_dL 

dt dt dqi^J dt 
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There are five terms here. The second cancels with the fourth. And the first (after 
using the E-L equation, eq. (5.3), to rewrite it) cancels with the third. We therefore 
arrive at the simple result, 


dE _ _dL 
dt dt 


(5.54) 


In the event that dL/dt = 0 (that is, there are no t 's sitting on the paper when you 
write down L), which is invariably the case in the situations we consider (because 
we won’t consider potentials that depend on time), we have dE/dt = 0. ■ 


Not too many things are constant with respect to time, and the quantity E has 
units of energy, so it’s a good bet that it is the energy. Let’s show this in cartesian 
coordinates (however, see the remark below). The Lagrangian is 


L = 


m(x 2 + y 2 + z 


- V(x,y,z), 


(5.55) 


so eq. (5.52) gives 

E = ^m(x 2 + y 2 + z 2 )+ V(x,y,z), (5.56) 

which is, of course, the total energy. The effect of the operations in eq. (5.52) in 
most cases is to simply switch the sign in front of the potential. 

Of course, taking the kinetic energy T and subtracting the potential energy V 
to obtain L, and then using eq. (5.52) to produce E = T + V, seems like a rather 
convoluted way of arriving at T + V. But the point of all this is that we used the 
E-L equations to prove that E is conserved. Although we know very well from the 
F = ma methods in Chapter 4 that the sum T + V is conserved, it’s not fair to 
assume that it is conserved in our new Lagrangian formalism. We have to show that 
this follows from the E-L equations. 

As with the translation and rotation invariance we observed in the examples 
in Section 5.5.1, we now see that energy conservation arises from time translation 
invariance. If the Lagrangian has no explicit t dependence, then the setup looks the 
same today as it did yesterday. This fact leads to conservation of energy. 

Remark: The quantity E in eq. (5.52) gives the energy of the system only if the 
entire system is represented by the Lagrangian. That is, the Lagrangian must represent a 
closed system with no external forces. If the system is not closed, then Claim 5.3 (or more 
generally, eq. (5.54)) is still perfectly valid for the E defined in eq. (5.52), but this E may 
simply not be the energy of the system. Problem 8 is a good example of such a situation. 

Another example is projectile motion in the x-y plane. The normal thing to do is 
to say that the particle moves under the influence of the potential V(y) = mgy. The 
Lagrangian for this closed system is L = m(x 2 + y) 2 / 2 — mgy, and so eq. (5.52) gives 
E = m(x 2 + y) 2 /2 + mgy, which is indeed the energy of the particle. However, another way 
to do this problem is to consider the particle to be subject to an external gravitational force, 
which gives an acceleration of —g in the y direction. If we assume that the mass starts at 
rest, then y = —gt. The Lagrangian is therefore L = mi 2 / 2 + m{gt) 2 / 2, and so eq. (5.52) 
gives E = mi 2 / 2 — m(gt) 2 / 2. This is not the energy. 

At any rate, most of the systems we will deal with are closed, so you can usually ignore 
this remark and assume that the E in eq. (5.52) gives the energy. A 
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5.6 Noether’s Theorem 

We now present one of the most beautiful and useful theorems in physics. It deals 
with two fundamental concepts, namely symmetry and conserved quantities. The 
theorem (due to Emmy Noether) may be stated as follows. 

Theorem 5.4 (Noether’s Theorem) For each symmetry of the Lagrangian, there 
is a conserved quantity. 

By “symmetry”, we mean that if the coordinates are changed by some small quan¬ 
tities, then the Lagrangian has no first-order change in these quantities. By “con¬ 
served quantity”, we mean a quantity that does not change with time. The result 
in Section 5.5.1 for cyclic coordinates is a special case of this theorem. 

Proof: Let the Lagrangian be invariant (to first order in the small number e) 

under the change of coordinates, 


Qi —>Qi + eKi(q). 


(5.57) 


Each Kf q) may be a function of all the q % . which we collectively denote by the 
shorthand, q. 

Remark: As an example of what these Kf s might look like, consider the Lagrangian 
(which we just pulled out of a hat), L = (rn/2)(5i; 2 — 2 xy + 2 y 2 ) + C(2x — y). This is 
invariant under the transformation x —> x + e and y —> y + 2e, as you can easily check. (It 
is actually invariant to all orders in e, and not just first order. But this isn’t necessary for 
the theorem to hold.) Therefore, K x = 1 and K y = 2. In the problems we’ll be doing, the 
Kf s can generally be determined by simply looking at the potential term. 

Of course, someone else might come along with K x = 3 and K y = 6, which is also a 
symmetry. And indeed, any factor can be taken out of e and put into the Kf s without 
changing the quantity eKfq) in eq. (5.57). Any such modification will simply bring an 
overall constant factor (and hence not change the property of being conserved) into the 
conserved quantity in eq. (5.60) below. It is therefore irrelevant. 4k 

The fact that the Lagrangian does not change at first order in e means that 


de 


/ dL dqi dL dq t \ 

— de dqi de ) 



Using the E-L equation, eq. (5.3), we may rewrite this as 


0 = 



(5.58) 


(5.59) 
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Therefore, the quantity 

( 5 - 6 °) 

does not change with time. It is given the generic name of conserved momentum. 
But it need not have the units of linear momentum. ■ 

As Noether most keenly observed 

(And for which much acclaim is deserved), 

We can easily see, 

That for each symmetry, 

A quantity must be conserved. 


Example 1: Consider the Lagrangian in the above remark, L = (m/2) (5x 2 — 2 xy + 
2 y 2 ) + C(2x — y). We saw that K x = 1 and K y = 2. The conserved momentum is 
therefore 


P(x,y,x,y) = —K x +—K y = m(5x-y)(l) + m(-x+2y)(2) = m(3x+3y). (5.61) 
ox oy 

The overall factor of 3m doesn’t matter, of course. 

Example 2: Consider a thrown ball. We have L = (m/2) (x 2 + y 2 + z 2 ) — rrigz. This 
is invariant under translations in x, that is, x x + e: and also under translations in 
y, that is, y —* y + e. (Both x and y are cyclic coordinates.) Note that we only need 
invariance to first order in e for Noether’s theorem to hold, but this L is invariant to 
all orders. 

We therefore have two symmetries in our Lagrangian. The first has K x = 1, K y = 0, 
and K z = 0. The second has K x = 0, K y = 1, and K z = 0. Of course, the nonzero 
K/s here may be chosen to be any constant, but we may as well pick them to be 1. 
The two conserved momenta are 


Pi(x,y,z,x,y,z) 

dL TX dL TX dL TX 
dx Kx + dy Ky + dz Kz ~ mX ' 


P 2 (x,y,z,x,y,z) 

dL 8L r , dL „ 

= m Kx+ di Kv *d* Kz = my - 

(5.62) 


These are simply the x- and y-components of the linear momentum, as we saw in 
Example 1 in Section 5.5.1. 

Note that any combination of these momenta, say 3Pi + &P-2, is also conserved. (In 
other words, x —> x + 3e, y — > y + 8e, 2 —> z is a symmetry of the Lagrangian.) But 
the above Pi and P 2 are the simplest conserved momenta to choose as a “basis” for 
the infinite number of conserved momenta (which is how many you have, if there are 
two or more independent symmetries). 

Example 3: Consider a mass on a spring, with zero equilibrium length, in the x-y 
plane. The Lagrangian, L = (m/2)(x 2 + y 2 ) — (k/2)(x 2 + y 2 ), is invariant under the 
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change of coordinates, x —> x + ey, y —> y — ex, to first order in e (as you can check). 
So we have K x = y and K y = —x. The conserved momentum is therefore 

P(x,y,x,y) = ^ K x + ^ K y = m{xy-yx). (5.63) 

This is simply the (negative of the) ^-component of the angular momentum. The 
angular momentum is conserved here because the potential, V(x,y ) = x * 1 2 3 4 5 + y 2 = r 2 , 
depends only on the distance from the origin; we’ll discuss such potentials in Chapter 
6. 

In contrast with the first two examples above, the x —> x + ey, y —► y — ex transfor¬ 
mation isn’t so obvious here. How did we get this? Well, unfortunately there doesn’t 
seem to be any fail-proof method of determining the iQ’s in general, so sometimes 
you just have to guess around, as was the case here. But in many problems, the AV s 
are simple constants which are easy to see. 


Remarks: 

1. As we saw above, in some cases the Ki s are functions of the coordinates, and in some 
cases they are not. 

2. The cyclic-coordinate result in eq. (5.48) is a special case of Noether’s theorem, for 
the following reason. If L doesn’t depend on a certain coordinate qk, then q/. —» qk + e 
is certainly a symmetry. Hence Kk = 1 (with all the other K i ' > s equal to zero), and 
eq. (5.60) reduces to eq. (5.48). 

3. We use the word “symmetry” to describe the situation where the transformation in eq. 
(5.57) produces no first-order change in the Lagrangian. This is an appropriate choice 
of word, because the Lagrangian describes the system, and if the system essentially 
doesn’t change when the coordinates are changed, then we say that the system is 
symmetric. For example, if we have a setup that doesn’t depend on 9, then we say 
that the setup is symmetric under rotations. Rotate the system however you want, 
and it looks the same. The two most common applications of Noether’s theorem are 
the conservation of angular momentum, which arises from symmetry under rotations; 
and conservation of linear momentum, which arises from symmetry under translations. 

4. In simple systems, as in Example 2 above, it is clear why the resulting P is conserved. 
But in more complicated systems, as in Example 1 above (which has an L of the 
type that arises in Atwood’s machine problems; see Exercise 11 and Problem 9), the 
resulting P might not have an obvious interpretation. But at least you know that it 
is conserved, and this will invariably help in solving a problem. 

5. Although conserved quantities are extremely useful in studying a physical situation, 
it should be stressed that there is no more information contained in the them than 
there is in the E-L equations. Conserved quantities are simply the result of integrating 
the E-L equations. For example, if you write down the E-L equations for Example 1 
above, and then add the “x” equation (which is 5 mx — my = 2(7) to twice the “y” 
equation (which is —mx + 2 my = —C), then you find 3 m(x + ij) = 0. In other words, 
3 m(x + y) is constant, as we found from Noether’s theorem. 

Of course, you might have to do some guesswork to find the proper combination of 
the E-L equations that gives a zero on the right-hand side. But you’d have to do 
some guesswork anyway, to find the symmetry for Noether’s theorem. At any rate, a 
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conserved quantity is useful because it is an integrated form of the E-L equations. It 
puts you one step closer to solving the problem, compared to where you would be if 
you started with the second-order E-L equations. 

6. Does every system have a conserved momentum? Certainly not. The one-dimensional 
problem of a falling ball ( mz = —mg) doesn’t have one. And if you write down an 
arbitrary potential in 3-D, odds are that there won’t be one. In a sense, things have 
to contrive nicely for there to be a conserved momentum. In some problems, you can 
just look at the physical system and see what the symmetry is, but in others (for 
example, in the Atwood’s-machine problems for this chapter), the symmetry is not at 
all obvious. 

7. By “conserved quantity”, we mean a quantity that depends on (at most) the coor¬ 
dinates and their first derivatives (that is, not on their second derivatives). If we 
do not make this restriction, then it is trivial to construct quantities that do not 
vary with time. For example, in Example 1 above, the “x” E-L equation (which is 
5 mx — my = 2 C) tells us that 5 mx — my has its time derivative equal to zero. Note 
that an equivalent way of excluding these trivial cases is to say that the value of a 
conserved quantity depends on initial conditions (that is, velocities and positions). 
The quantity 5 mx — my does not satisfy this criterion, because its value is always 
constrained to be 2 C. £ 


5.7 Small oscillations 


In many physical systems, a particle undergoes small oscillations around an equilib¬ 
rium point. In Section 4.2, we showed that the frequency of these small oscillations 
is _ 

where V{x) is the potential energy, and xq is the equilibrium point. 

However, this result holds only for one-dimensional motion (we will see below 
why this is true). In more complicated systems, such as the one described below, it 
is necessary to use another procedure to obtain the frequency u. This procedure is 
a fail-proof one, applicable in all situations. It is, however, a bit more involved than 
simply writing down eq. (5.64). So in one-dimensional problems, eq. (5.64) is still 
what you want to use. 

We’ll demonstrate our fail-proof method through the following problem. 


Problem: A mass m is free to move on a frictionless table and is connected by a 

string, which passes through a hole in the table, to a mass M which hangs below (see 
Fig. 5.5). Assume that M moves in a vertical line only, and assume that the string 
always remains taut. 

(a) Find the equations of motion for the variables r and 6 shown in the figure. 

(b) Under what condition does m undergo circular motion? 

(c) What is the frequency of small oscillations (in the variable r) about this circular 
motion? 



M 

Figure 5.5 






V-22 


CHAPTER 5. THE LAGRANGIAN METHOD 


Solution: 


(a) Let the string have length l (this length won’t matter). Then the Lagrangian 
(we’ll call it here, to save “L” for the angular momentum, which arises 
below) is 

T = I Mr 2 + ^ro(f 2 + r 2 9 2 ) + Mg{£ - r). (5.65) 

For the purposes of the potential energy, we’ve taken the table to be at height 
zero, but any other value could be chosen, of course. The equations of motion 
obtained from varying 6 and r are 


dC 

(.M + m)f 


0 , 

mrQ 2 — Mg. 


(5.66) 


The first equation says that angular momentum is conserved (much more about 
this in Chapter 6 ). The second equation says that the Mg gravitational force 
accounts for the acceleration of the two masses along the direction of the string, 
plus the centripetal acceleration of m. 

(b) The first of eqs. (5.66) says that mr 2 6 = L, where L is some constant (the 
angular momentum) which depends on the initial conditions. Plugging 6 = 
L/mr 2 into the second of eqs. (5.66) gives 

L 2 

(M + m)r= — -—Mg. (5.67) 

mr 6 

Circular motion occurs when f = r 
orbit is given by 


= 0. Therefore, the radius of the circular 

’tLg- <5 ' 68) 


Remark: Note that since L = mr 2 0, eq. (5.68) is equivalent to 

mr 0 9 2 = Mg, (5.69) 

which can be obtained by simply letting r = 0 in the second of eqs. (5.66). In other 
words, the gravitational force on M exactly accounts for the centripetal acceleration 
of m if the motion is circular. Given ro, eq. (5.69) determines what 6 must be (in 
order to have circular motion), and vice versa. Jk 

(c) To find the frequency of small oscillations about the circular motion, we need 
to look at what happens to r if we perturb it slightly from its equilibrium value, 
ro- Our fail-proof procedure is the following. 

Let r(t) = ro + S(t), where S(t) is very small (more precisely, S(t) ro), and 
expand eq. (5.67) to first order in 6(t). Using 


1 1 _ 1 _ 1 1 / 3<J\ 

r 3 ~ (r 0 + 6) 3 ~ rl +.3rg5 rg(l + 3<tyr 0 ) ~ rg V r o) ’ 


(5.70) 


we obtain 2 

(M + m)6 « ^3 (l- —) -Mg. (5.71) 

mr o \ r o ) 

The terms not involving S on the right-hand side cancel, by the definition of ro 
given in eq. (5.68). This cancellation will always occur in such a problem at 
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this stage, due to the definition of the equilibrium point. We are therefore left 
with 2 

£ + ( , nr 3L . -xV«0. (5.72) 

This is a good old simple-harmonic-oscillator equation in the variable S. There¬ 
fore, the frequency of small oscillations about a circle of radius r 0 is 


3 h 2 

(M + m)rnrg 


(5.73) 


where we have used eq. (5.68) to eliminate L in the second expression. 

To sum up, the above frequency is the frequency of small oscillations in the 
variable r. In other words, if you plot r as a function of time (and ignore what 
9 is doing), then you will get a nice sinusoidal graph whose frequency is given 
by eq. (5.73), provided that that amplitude is small. Note that this frequency 
need not have anything to do with the other relevant frequency in this problem, 
namely the frequency of the circular motion, which is y/Mg/mro, from eq. 
(5.69). 


R emarks: Let’s look at some limits. For a given ro, if m > M, then u> « 

y/3Mg/mro « 0. This makes sense, because everything will be moving very slowly. 
Note that this frequency is equal to \/3 times the frequency of circular motion, Mg/mro, 
which isn’t at all obvious. 

For a given ro, if m -C M, then ui « y/3g/ro , which isn’t so obvious, either. 

Note that the frequency of small oscillations is equal to the frequency of circular motion 
if M = 2m (once again, not obvious). This condition is independent of ro. A 


The above procedure for finding the frequency of small oscillations may be 
summed up in three steps: (1) Find the equations of motion, (2) Find the equi¬ 
librium point, and (3) Let x(t ) = xq + 5(t), where xq is the equilibrium point of the 
relevant variable, and expand one of the equations of motion (or a combination of 
them) to first order in 5, to obtain a simple-harmonic-oscillator equation for 5. 

Remark: Note that if you simply used the potential energy in the above problem 
(which is Mgr, up to a constant) in eq. (5.64), then you would obtain a frequency of zero, 
which is incorrect. You can use eq. (5.64) to find the frequency, if you instead use the 
“effective potential” for this problem, namely L 2 /(2mr 2 ) + Mgr, and if you use the total 
mass, M + m, as the mass in eq. (5.64), as you can check. The reason why this works will 
become clear in Chapter 6 when we introduce the effective potential. 

In many problems, however, it is not obvious what “modified potential” should be used, 
or what mass should be used in eq. (5.64), so it is generally much safer to take a deep breath 
and go through an expansion similar to the one in part (c) above. 

The one-dimensional result in eq. (5.64) is, of course, simply a special case 
of our above expansion procedure. We can repeat the derivation of Section 4.2 
in the present language. In one dimension, we have mx = —V'(x). Let xq be 
the equilibrium point (so that V'(xq) = 0), and let x(t) = xq + 5(t). Expanding 
mx = — V'(x) to first order in <5, we have m6 = —V'(xq) — V"(xo)S — ■ ■ ■. Hence, 
m5 ~ —V"(xq)5, as desired. 
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5.8 Other applications 



Figure 5.6 



Figure 5.7 


The formalism developed in Section 5.2 works for any function L(x. x, t ). If our goal 
is to find the stationary points of S = f L, then eq. (5.14) holds, no matter what L 
is. There is no need for L to be equal to T — V, or indeed, to have anything to do 
with physics. And t need not have anything to do with time. All that is required 
is that the quantity x depend on the parameter t, and that L depend on only x, x, 
and t (and not, for example, on x; see Exercise 6 ). The formalism is very general 
and powerful, as the following example demonstrates. 


Example (Minimal surface of revolution): A surface of revolution has two given 
rings as its boundary; see Fig. 5.6. What should the shape of the surface be so that 
it has the minimum possible area? We’ll present two solutions. A third is left for 
Problem 23. 


First solution: Let the surface be generated by rotating the curve y = y(x) around 
the 2 :-axis. The boundary conditions are y(a i) = c\ and yia-i) = C 2 ; see Fig. 5.7. 
Slicing the surface up into vertical rings, we see that the area is given by 




(5.74) 


The goal is to find the function y(x) that minimizes this integral. We therefore have 
exactly the same situation as in Section 5.2, except that x is now the parameter 
(instead of t), and y in now the function (instead of x). Our “Lagrangian” is thus 
L oc j/ \/l + y' 2 . To minimize the integral A, we “simply” have to apply the E-L 
equation to this Lagrangian. This calculation, however, gets a bit tedious, so we’ve 
relegated it to Lemma 5.5 at the end of this section. For now we’ll just use the result 
in eq. (5.83) which gives (with f(y) = y here) 


1 + y' 2 = 1 


(5.75) 


At this point we can cleverly guess (motivated by the fact that 1 + sinh 2 z = cosh 2 z) 
that the solution is 

y(x) = - cosh 6 ( 2 ; + d), (5.76) 


where b = \[B, and d is a constant of integration. Or, we can separate variables to 
obtain (again with b = VB) 


and then use 
result. 


dx = , % , (5.77) 

V(by) 2 - 1 

the fact that the integral of 1 / \/z* — 1 is cosh -1 z, to obtain the same 


The answer to our problem, therefore, is that y(x) takes the form of eq. (5.76), with 
b and d determined by the boundary conditions, 


C 2 = j- cosh b(a 2 + d). 


(5.78) 


In the symmetrical case where C\ = C 2 , we know that the minimum occurs in the 
middle, so we may choose d = 0 and a\ = — 02 - 
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Remark: Solutions for b and d exist only for certain ranges of the a’s and c’s. Basically, if 
02 — 01 is too large, then there is no solution. In this case, the minimal “surface” turns out 
to be the two given circles, attached by a line (which isn’t a nice two-dimensional surface). 
If you perform an experiment with soap bubbles (which want to minimize their area), and if 
you pull the rings too far apart, then the surface will break and disappear as it tries to form 
the two circles. Problem 24 deals with this issue. A 


Second solution: Consider the curve that we rotate around the x-axis to be de¬ 
scribed now by the function x(y). That is, let x be a function of y. The area is then 
given by 




(5.79) 


where x' = dx/dy. Note that the function x{y) may be double-valued, so it may not 
really be a function. But it looks like a function locally, and all of our formalism deals 
with local variations. 

Our “Lagrangian” is now L oc y\/l + x' 2 , and the E-L equation is 


dy \ dx' J dx 


_d_ ( yx ' \ 

dy VVl+ar'v 


= 0 . 


(5.80) 


The nice thing about this solution is the “0” on the right-hand side, which arises from 
the fact that L does not depend on x (that is, x is a cyclic coordinate). Therefore, 
yx'/V 1 + x' 2 is constant. If we define this constant to be 1/6, then we may solve for 
x' and then separate variables to obtain 


dx = 


dy 


in agreement with eq. (5.77). The solution proceeds as above. 


(5.81) 


Numerous other “extremum” problems are solvable with these general tech¬ 
niques. A few are presented in the problems for this chapter. 

Let us now prove the following lemma, which we invoked in the first solution 
above. This lemma is very useful, because it is common to encounter problems 
where the quantity to be extremized depends on the arclength, y/1 + if 2 . and takes 
the form / f(y )\/1 + y /2 dx. 

We will give two proofs. The first proof uses the Euler-Lagrange equation. The 
calculation gets a bit messy, so it’s a good idea to work through it once and for all, 
and then just invoke the result whenever needed. The derivation isn’t something 
you’d want to repeat too often. The second proof makes use of a conserved quantity. 
And in contrast with the first proof, this method is exceedingly clean and simple. 
It actually is something you’d want to repeat quite often. But we’ll still do it once 
and for all. 

Lemma 5.5 Let f(y) be a given function ofy. Then the function y(x ) that extrem- 
izes the integral, 
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satisfies the differential equation, 

1 +y n = Bf{yf. i (5.83) 

where B is a constant of integration. 12 


First Proof: Our goal is to find the function y(x) that extremizes the integral in 
eq. (5.82). We therefore have exactly the same situation as in Section 5.2, except 
with x in place of t, and y in place of x. Our “Lagrangian” is thus L = f(y)y/\ + y /2 , 
and the Euler-Lagrange equation is 


d_ fdL\ _ a L 
dx \dy') dy 


A. 

dx 



(5.84) 


where f = df /dy. We must now perform some straightforward (albeit tedious) 
differentiations. Using the product rule on the three factors on the left-hand side, 
and making copious use of the chain rule, we obtain 



fy r2 y" 

(1 + y' 2)3/2 


(5.85) 


Multiplying through by (1 + y /2 ) 3 / 2 and simplifying gives 


fy” = /'(i + y' 2 )- 


(5.86) 


We have completed the first step of the proof, namely producing the Euler-Lagrange 
differential equation. We must now integrate it. Eq. (5.86) happens to be integrable 
for arbitrary functions f(y). If we multiply through by y' and rearrange, we obtain 


y'y" = /V 
1 + y' 2 f 


(5.87) 


Taking the dx integral of both sides gives (1/2) ln(l + y' 2 ) = ln(/) + C, where C is 
a constant of integration. Exponentiation then gives (with B = e 2C ) 


1 + y /2 = Bf(y) 2 , 


(5.88) 


as we wanted to show. In an actual problem, we would solve this equation for y', 
and then separate variables and integrate. But we would need to be given a specific 
function f(y) to be able do this. 


Second Proof: Note that our “Lagrangian”, L = f(y)\/l + y 12 , is independent of 
x. Therefore, in analogy with the conserved energy given in eq. (5.52), the quantity 

(5.89) 

dy v 1 + y' 2 

is independent of x. Call it 1 /y/B. Then we have easily reproduced eq. (5.88). ■ 


Important remark: As demonstrated by the brevity of this second proof, it is highly 
advantageous to make use of a conserved quantity (for example, the E here, which arose 
from independence of x ) whenever you can. X 

12 The constant B, and also one other constant of integration (arising when eq. (5.83) is integrated 
to solve for y), is determined by the boundary conditions on y(x). 
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5.9 Exercises 

Section 5.1: The Euler-Lagrange equations 

1. Three falling sticks *** 

Three massless sticks of length 2r, each with a mass m fixed at its middle, 
are hinged at their ends, as shown in Fig. 5.8. The bottom end of the lower 
stick is hinged at the ground. They are held such that the lower two sticks 
are vertical, and the upper one is tilted at a small angle e with respect to 
the vertical. They are then released. At this instant, what are the angular 
accelerations of the three sticks? Work in the approximation where e is very 
small. (You may want to look at Problem 3 first.) 

2. Coffee cup and mass *** 

A coffee cup of mass M is connected to a mass to by a string. The coffee cup 
hangs over a pulley (of negligible size), and the mass to is held horizontally, 
as shown in Fig. 5.9. The mass to is released. Find the equations of motion 
for r (the length of string between to and the pulley) and 9 (the angle that 
the string to to makes with the horizontal). Assume that to somehow doesn’t 
run into the string holding the cup up. 

The coffee cup will of course initially fall, but it turns out that it will reach a 
lowest point and then rise back up. Write a program (see Appendix D) that 
numerically determines the ratio of the r at this point to the r at the start, 
for a given value of m/M. (To check your program, a value of m/M = 1/10 
yields a ratio of about 0.208.) 


Section 5.2: The principle of stationary action 

3. Dropped ball * 

Consider that action, from t = 0 to t = 1, of a ball dropped from rest. 
From the E-L equation (or from F = ma), we know that y(t) = —gt 2 /2 
yields a stationary value of the action. Show explicitly that the function 
y(t ) = —gt 2 /2 + et(t — 1 ) yields an action that has no first-order dependence 
on e. 


4. Second-order change * 

Let x a (t) = x'o(t) + afl(t)- Eq. (5.16) gives the first derivative of the action 
with respect to a. Show that the second derivative is 


df 
da 2 


s[x a (t)] = r 

Jtl 


d 2 L, 


b 2 ,,. PP + ) dt. 

oxox ox z ' 


(5.90) 


5. Explicit minimization * 

A ball is thrown upward. Let y(t) be the height as a function of time, and 
assume y(0) = 0 and y(T) = 0. Guess a solution for y of the form y(t) = 
ao + ait + « 2 t 2 , and explicitly calculate the action between t = 0 and t = T. 
Show that the action is minimized when a 2 = —g/ 2. (This gets slightly messy.) 



Figure 5.8 


M {3 
Figure 5.9 
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4m 2m 


Figure 5.10 


6. x dependence ** 

Let there be x dependence (in addition to x,x,t dependence) in the Lagrangian 
in Theorem 5.1. There will then be the additional term ( dL/dx a )j3 in eq. 
(5.18). It is tempting to integrate this term by parts twice, and then arrive at 
a modified form of eq. (5.21): 


dL d t dL \ d 2 (dL\_ 

dxo dt Vdio/ dt 2 \<9alo/ 


(5.91) 


Is this a valid result? If not, where is the error in its derivation? 


Section 5.3: Forces of constraint 

7. Constraint on a circle 

A bead slides with speed v around a horizontal loop of radius R. What force 
does the loop apply to the bead? (Ignore gravity.) 

8. Constraint on a curve ** 

Let the horizontal plane be the x-y plane. A bead slides with speed v along 
a curve described by the function y = f(x). What force does the curve apply 
to the bead? (Ignore gravity.) 

9. Cartesian coordinates ** 

In eq. (5.34), take two derivatives of the \Jx 2 + y 2 — R = 0 equation to obtain 

R 2 (xx + yy) - (xy - yx) 2 = 0, (5.92) 

and then combine this with the other two equations to solve for F. Show that 
your result agrees with eq. (5.31). 


Section 5.5: Conservation Laws 

10. Bead on stick, using F = ma * 

After doing Problem 8 , show again that the quantity E is conserved, but now 
use F = ma. Do this is two ways: 

(a) Use the first of eqs. (2.52). Hint: multiply through by r. 

(b) Use the second of eqs. (2.52) to calculate the work done on the bead. 

11. Atwood’s machine ** 

Consider the Atwood’s machine shown in Fig. 5.10. The masses are 5m, 4m, 
and 2m. Let x and y be the heights of the left two masses, relative to their 
initial positions. Use Noether’s Theorem to find the conserved momentum. 
(The solution to Problem 9 gives some other methods, too.) 
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5.10 Problems 


Section 5.1: The Euler-Lagrange equations 

1. Moving plane ** 

A block of mass m is held motionless on a frictionless plane of mass M and 
angle of inclination 0 (see Fig. 5.11). The plane rests on a frictionless hori¬ 
zontal surface. The block is released. What is the horizontal acceleration of 
the plane? (This problem already showed up as Problem 2.2. If you haven’t 
already done so, try solving it using F = ma. You will then have a greater 
appreciation for the Lagrangian method.) 

2. Two masses, one swinging *** 

Two equal masses, m, connected by a string, hang over two pulleys (of negli¬ 
gible size), as shown in Fig. 5.12. The left one moves in a vertical line, but 
the right one is free to swing back and forth (in the plane of the masses and 
pulleys). Find the equations of motion for r and 0, as shown. 

Assume that the left mass starts at rest, and the right mass undergoes small 
oscillations with angular amplitude e (with e<Cl). What is the initial average 
acceleration (averaged over a few periods) of the left mass? In which direction 
does it move? 

3. Two falling sticks ** 

Two massless sticks of length 2 r, each with a mass m fixed at its middle, are 
hinged at an end. One stands on top of the other, as shown in Fig. 5.13. The 
bottom end of the lower stick is hinged at the ground. They are held such 
that the lower stick is vertical, and the upper one is tilted at a small angle e 
with respect to the vertical. They are then released. At this instant, what are 
the angular accelerations of the two sticks? Work in the approximation where 
e is very small. 



Figure 5.11 


Figure 5.12 



Figure 5.13 


4. Pendulum with an oscillating support ** 

A pendulum consists of a mass m and a massless stick of length t. The pendu¬ 
lum support oscillates horizontally with a position given by x(t ) = A cos(ojt) 
(see Fig. 5.14). Calculate the angle of the pendulum as a function of time. 

5. Inverted pendulum **** 

A pendulum consists of a mass m at the end of a massless stick of length i. 
The other end of the stick is made to oscillate vertically with a position given 
by y{t) = Acos(u;f), where A <C i. See Fig. 5.15. It turns out that if u is 
large enough, and if the pendulum is initially nearly upside-down, then it will 
surprisingly not fall over as time goes by. Instead, it will (sort of) oscillate 
back and forth around the vertical position. 



Figure 5.14 



Figure 5.15 
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Find the equation of motion for the angle of the pendulum (measured relative 
to its upside-down position). And explain why the pendulum doesn’t fall over, 
and find the frequency of the back and forth motion. 


Section 5.2: The principle of stationary action 

6. Minimum or saddle ** 


(a) In eq. (5.25), let t\ = 0 and £2 = T, for convenience. And let £(t) be an 
easy-to-deal-with “triangular” function, of the form 


j et/T, 0 <t< T/2, 

\ e(l ~t/T), T/2 < t < T. 


(5.93) 


Under what conditions is the harmonic-oscillator AS in eq. (5.25) nega¬ 
tive? 


(b) Answer the same question, but now with £(f) = esm(nt/T). 





ft 
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Figure 5.16 



Section 5.3: Forces of constraint 

7. Normal force from a plane ** 

A mass m slides down a frictionless plane that is inclined at angle 9. Show, 
using the method in Section 5.3, that the normal force from the plane is the 
familiar mg cos 9. 

Section 5.5: Conservation Laws 

8. Bead on a stick * 

A stick is pivoted at the origin and is arranged to swing around in a horizontal 
plane at constant angular speed u. A bead of mass m slides frictionlessly along 
the stick. Let r be the radial position of the bead. Find the conserved quantity 
E given in eq. (5.52). Explain why this quantity is not the energy of the bead. 

Section 5.6: Noether’s Theorem 

9. Atwood’s machine ** 

Consider the Atwood’s machine shown in Fig. 5.16. The masses are 4m, 3m, 
and m. Let x and y be the heights of the left and right masses, relative to 
their initial positions. Find the conserved momentum. 

Section 5.7: Small oscillations 

10. Hoop and pulley ** 

A mass M is attached to a massless hoop (of radius R) which lies in a vertical 
plane. The hoop is free to rotate about its fixed center. M is tied to a string 
which winds part way around the hoop, then rises vertically up and over a 
massless pulley. A mass m hangs on the other end of the string (see Fig. 5.17). 
Find the equation of motion for the angle of rotation of the hoop. What is 
the frequency of small oscillations? Assume that m moves only vertically, and 
assume M > m. 


Figure 5.17 
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11. Bead on a rotating hoop ** 

A bead is free to slide along a frictionless hoop of radius R. The hoop rotates 
with constant angular speed u around a vertical diameter (see Fig. 5.18). 
Find the equation of motion for the position of the bead. What are the 
equilibrium positions? What is the frequency of small oscillations about the 
stable equilibrium? 

There is one value of lo that is rather special. What is it, and why is it special? 

12. Another bead on a rotating hoop ** 

A bead is free to slide along a frictionless hoop of radius r. The plane of the 
hoop is horizontal, and the center of the hoop travels in a horizontal circle of 
radius R, with constant angular speed u, about a given point (see Fig. 5.19). 
Find the equation of motion for the position of the bead. Also, find the 
frequency of small oscillations about the equilibrium point. 

13. Rotating curve *** 

The curve y(x) = b(x/a) x is rotated around the y-axis with constant fre¬ 
quency u (see Fig. 5.20). A bead moves frictionlessly along the curve. Find 
the frequency of small oscillations about the equilibrium point. Under what 
conditions do oscillations exist? (This problem gets a little messy.) 

14. Mass on a wheel ** 

A mass m is fixed to a given point on the edge of a wheel of radius R. The 
wheel is massless, except for a mass M located at its center (see Fig. 5.21). 
The wheel rolls without slipping on a horizontal table. Find the equation of 
motion for the angle through which the wheel rolls. For the case where the 
wheel undergoes small oscillations, find the frequency. 

15. Double pendulum **** 

Consider a double pendulum made of two masses, m\ and m 2 , and two rods 
of lengths l\ and £2 (see Fig. 5.22). Find the equations of motion. 

For small oscillations, find the normal modes and their frequencies for the 
special case £\ = £2 (and consider the cases mi = m 2 , mi > m 2 , and m\ <C 
m 2 ). Do the same for the special case m\ = m 2 (and consider the cases £\ = £ 2 , 
£\ £ 2 , and l\ <C £ 2 ). 



Figure 5.21 





V-32 


CHAPTER 5. THE LAGRANGIAN METHOD 



Figure 5.23 



Figure 5.24 



Figure 5.25 



Figure 5.26 


16. Pendulum with a free support ** 

A mass M is free to slide along a frictionless rail. A pendulum of length £ and 
mass m hangs from M (see Fig. 5.23). Find the equations of motion. For 
small oscillations, find the normal modes and their frequencies. 

17. Pendulum support on an inclined plane ** 

A mass M is free to slide down a frictionless plane inclined at angle /3. A 
pendulum of length £ and mass m hangs from M (see Fig. 5.24). Find the 
equations of motion. For small oscillations, find the normal modes and their 
frequencies. 

18. Tilting plane *** 

A mass M is fixed at the right-angled vertex where a massless rod of length 
£ is connected to a very long massless rod (see Fig. 5.25). A mass m is free 
to move frictionlessly along the long rod. The rod of length £ is hinged at a 
support, and the whole system is free to rotate, in the plane of the rods, about 
the support. 

Let 9 be the angle of rotation of the system, and let x be the distance between 
m and M. Find the equations of motion. Find the normal modes when 9 and 
x are both very small. 

19. Motion in a cone *** 

A particle slides on the inside surface of a frictionless cone. The cone is fixed 
with its tip on the ground and its axis vertical. The half-angle at the tip is a 
(see Fig. 5.26). Let r(t) be the distance from the particle to the axis, and let 
9(t) be the angle around the cone. Find the equations of motion. 

If the particle moves in a circle of radius Tq , what is the frequency, to, of this 
motion? If the particle is then perturbed slightly from this circular motion, 
what is the frequency, f2, of the oscillations about the radius ro? Under what 
conditions does U = oft 

Section 5.8: Other applications 

20. Shortest distance in a plane 

In the spirit of Section 5.8, show that the shortest path between two points in 
a plane is a straight line. 



Figure 5.27 


21. Index of refraction ** 

Assume that the speed of light in a given slab of material is proportional to 
the height above the base of the slab. 13 Show that light moves in circular arcs 
in this material; see Fig. 5.27. You may assume that light takes the path of 
least time between two points (Fermat’s principle of least time). 

13 In other words, the index of refraction of the material, n, as a function of the height, y, is given 
by n(y) = yo/y, where yo is some length that is larger than the height of the slab. 
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22. The brachistochrone *** 

A bead is released from rest at the origin and slides down a frictionless wire 
that connects the origin to a given point, as shown in Fig. 5.28. You wish to 
shape the wire so that the bead reaches the endpoint in the shortest possible 
time. Let the desired curve be described by the function y(x), with downward 
taken to be positive. Show that y(x ) satisfies 

1 + y' 2 = - . (5.94) 

y 

where C is a constant. Show that x and y may be written as 

x = a(0 — sin0), y = a(l — cosO). (5.95) 

This is the parametrization of a cycloid, which is the path taken by a point 
on the edge of a rolling wheel. 

23. Minimal surface ** 

Derive the shape of the minimal surface discussed in Section 5.8, by demanding 
that a cross-sectional “ring” (that is, the region between the planes x = x\ 
and x = X 2 ) is in equilibrium; see Fig. 5.29. Hint: The tension must be 
constant throughout the surface. 

24. Existence of a minimal surface ** 

Consider the minimal surface from Section 5.8, and look at the special case 
where the two rings have the same radius (see Fig. 5.30). Let 2£ be the distance 
between the rings. What is the largest value of i/r for which a minimal surface 
exists? You will have to solve something numerically here. 



y 

Figure 5.28 


ir1\ 





Figure 5.29 



Figure 5.30 
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Figure 5.31 


5.11 Solutions 


1. Moving plane 

Let xi be the horizontal coordinate of the plane (with positive x-\ to the left), and 
let x -2 be the horizontal coordinate of the block (with positive x 2 to the right); see 
Fig. 5.31. The relative horizontal distance between the plane and the block is xi+x 2 , 
so the height fallen by the block is (aq + X 2 ) tan #. The Lagrangian is therefore 

L = ^Mi; 2 + + (iq + x 2 ) 2 tan 2 #^ + mg(x 1 + x 2 ) tan#. (5.96) 

The equations of motion obtained from varying xi and X 2 are 

Mxi + m(xi + x 2 ) tan 2 # = mg tan 6 , 

mx 2 + m(aq + x 2 ) tan 2 # = mg tan#. (5.97) 


Note that the difference of these two equations immediately yields conservation of 
momentum, Mx 1 — mx 2 = 0 => {d/dt){Mx 1 — mx 2 ) = 0. Eqs. (5.97) are two linear 
equations in the two unknowns, x-\ and x 2 , so we can solve for aq. After a little 
simplification, we arrive at 


mg sin # cos # 
M + to sin 2 # 


(5.98) 


For some limiting cases, see the remark in the solution to Problem 2.2. 


2. Two masses, one swinging 

With r and # being the distance from the swinging mass to the pulley, and the angle 
of the swinging mass, respectively, the Lagrangian is 


L 


-m(r 2 + r 2 # 2 ) — mgr + mgr cos #. 


(5.99) 


The last two terms are the (negatives of the) potentials of each mass, relative to where 
they would be if the right mass were located at the right pulley. The equations of 
motion obtained from varying r and # are 

2 r = r# 2 — g(l — cos #), 

^(r 2 #) = —grsin#. (5.100) 


The first equation deals with the forces and accelerations along the direction of the 
string. The second equation equates the torque from gravity with the change in 
angular momentum of the right mass. 

If we do a (coarse) small-angle approximation and keep only terms up to first order 
in #, we find that at t = 0 (using the initial condition, r = 0), eqs. (5.100) become 


r = 0, 
- -# = 0. 


(5.101) 


These say that the left mass stays still, and the right mass behaves just like a pendu¬ 
lum. 

If we want to find the leading term in the initial acceleration of the left mass (that 
is, the leading term in r ), we need to be a little less coarse in our approximation. So 
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let’s keep terms in eq. (5.100) up to second order in 0. We then have at t = 0 (using 
the initial condition, r = 0 ) 


2 f = rO 2 — -g 6 2 , 

2 y 

0 + -0 = 0. (5.102) 

The second equation still says that the right mass undergoes harmonic motion. We 
are told that the amplitude is e, so we have 

9(t) = ecos(wt + <t>), (5.103) 

where uj = \Jg/r. Plugging this into the first equation gives 

2r = e 2 g ^sin 2 (wt + 4>) — ^ cos 2 (ut + <p)J . (5.104) 

If we average this over a few periods, both sin 2 a and cos 2 a average to 1/2, so we 
find 



This is a small second-order effect. It is positive, so the left mass slowly begins to 
climb. 

3. Two falling sticks 

Let 6 \{t) and 0 - 2 (t) be defined as in Fig. 5.32. Then the position of the bottom 
mass in cartesian coordinates is (rsinf^rcosdi), and the position of the top mass is 
(2rsin0! — r sin d 2 ■ 2r cos 0\ + r cos 62 ). So the potential energy of the system is 

V( 01 , 62 ) = mgr (3 cos 0\ + cos 62 ). (5.106) 

The kinetic energy is somewhat more complicated. The kinetic energy of the bottom 
mass is simply mr 2 6 \/ 2. Taking the derivative of the top mass’s position given above, 
we find that the kinetic energy of the top mass is 

^mr 2 ((2 cos 0 A - cos M 2) 2 + (-2 sin 0 A - sin M 2 ) 2 ) ■ (5.107) 

We can simplify this, using the small-angle approximations. The terms involving 
sin# will be fourth order in the small O' s, so we may neglect them. Also, we may 
approximate cos 0 by 1, because this entails dropping only terms of at least fourth 
order. So the top mass’s kinetic energy becomes (l/2)rnr 2 (20i — f^) 2 - In retrospect, it 
would have been easier to obtain the kinetic energies of the masses by first applying the 
small-angle approximations to the positions, and then taking the derivatives to obtain 
the velocities. This strategy will show that both masses move essentially horizontally 
(initially). You will probably want to use this strategy when solving Exercise 1. 
Using the small-angle approximation cos 0 ~ 1 — 0 2 /2 to rewrite the potential energy 
in eq. (5.106), we have 

L « l -mr 2 (5 0\ - 4M 2 + 0 2 ) - mgr (4 - ^0 2 - ^ . 



Figure 5.32 


(5.108) 
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x(t) 



Figure 5.33 


The equations of motion obtained from varying 61 and 0 2 are, respectively, 

50i - 202 = —0i 

-201 +02 = -@ 2 - (5.109) 

At the instant the sticks are released, we have 0i = 0 and 02 = e. Solving eqs. (5.109) 
for 0 i and 0 2 gives 

0i= 2 y, and h = (5.110) 

4. Pendulum with an oscillating support 

Let 0 be defined as in Fig. 5.33. With x(t ) = Acos(wf), the position of the mass m 
is given by 

(X,Y) m = (x + £ sin <?,-.£ cos 0). (5.111) 

The square of the speed is 

V 2 = £ 2 0 2 + x 2 + 21x0 cos 0. (5.112) 


The Lagrangian is therefore 

L = ^ m(i 2 0 2 + x 2 + 21x0 cos 0) + mg£ cos 0. (5.113) 

The equation of motion obtained from varying 0 is 

^(m £ 2 0 + mix cos, 0 ) = — m£x 0 sin 0 — mg£ sin 0 

=4* £0 + x cos 0 = -g sin (9. (5.114) 

Plugging in the explicit form of x(t), we have 

£0 — Au ) 2 cos (cut) cos 0 + g sin 0 = 0. (5.115) 


This makes sense. Someone in the frame of the support (which has horizontal acceler¬ 
ation x = —Au ) 2 cos(ajt)) may as well be living in a world where the acceleration from 
gravity has a component g downward and a component Au 2 cos(wf) to the right. Eq. 
(5.122) is simply the F = ma equation in the tangential direction in this accelerating 
world. 

A small-angle approximation in eq. (5.115) gives 

0 + Wq0 = aJ 2 cos(wf), (5.116) 

where wo = \fg/£ and a = A/£. This equation is simply that of a driven oscillator, 
which we solved in Chapter 3. The solution is 


0(t) = ~2 -+cps(uit) + Ccos(u 0 t + <f)), (5.117) 

u 0 —u 


where C and cj> are determined by the initial conditions. 

If w happens to equal ui( h then the amplitude becomes large. Eq. (5.117) would seem 
to suggest that the amplitude actually goes to infinity in this case. But as soon as 
the amplitude becomes large, our small-angle approximation breaks down, and eqs. 
(5.116) and (5.117) are no longer valid. 
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5. Inverted pendulum 

Let 9 be defined as in Fig. 5.34. With y(t) = A cos(ujt), the position of the mass m 
is given by 

(X,Y) = (£smO,y + £cos9). (5.118) 

Taking the derivatives of these coordinates, we see that the square of the speed is 

V 2 = X 2 + Y 2 = £ 2 9 2 + y 2 — 2£y9 sin 9. (5.119) 

The Lagrangian is therefore 

L = ^ m(£ 2 9 2 + y 2 — 2£y9sin9 ) — mg(y + cos 9). (5.120) 

The equation of motion for 9 is 

S(S) = S iii — ysinO = gsinO. (5.121) 

Plugging in the explicit form of y(t), we have 

£9 + sin 9(^Aiv 2 cos(wi) ~ o) = 0- (5.122) 

In retrospect, this makes sense. Someone in the reference frame of the support, 
which has acceleration y = —Au> 2 cos(ajt), may as well be living in a world where the 
acceleration from gravity is g — Aj 2 cos(wi) downward. Eq. (5.122) is simply the 
F = ma equation in the tangential direction in this accelerated frame. 

Assuming 9 is small, we may set sin# « 9, which gives 

9 + 9(auj 2 cos(u>t) - = 0, (5.123) 

where u> 0 = y/g /£, and a = A/l. Eq. (5.123) cannot be solved exactly, but we can 
still get a good idea of how 9 depends on time. We can do this both numerically and 
(approximately) analytically. 

The figures below show how 9 depends on time for parameters with values l = lm, 
A = 0.1m, and g = 10m/s 2 (so a = 0.1, and Uq = 10s -2 ). In the first plot, 
u = 10 s -1 . And in the second plot, u = 100 s -1 . The stick falls over in first case, 
but undergoes oscillatory motion in the second case. Apparently, if u is large enough 
the stick will not fall over. 



Figure 5.34 


theta 
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Let’s now explain this phenomenon analytically. At first glance, it’s rather surprising 
that the stick stays up. It seems like the average (over a few periods of the w oscil¬ 
lations) of the tangential acceleration in eq. (5.123), namely — 9(au 2 cos(wf) — u)q), 
equals the positive quantity Ooj'q , because the cos(c ot) term averages to zero (or so it 
appears). So you might think that there is a net force making 9 increase, causing the 
stick fall over. 

The fallacy in this reasoning is that the average of the — au 2 9cos(ut) term is not zero, 
because 9 undergoes tiny oscillations with frequency u, as seen below. Both of these 
plots have a = 0.005, cog = 10 s -2 , and u> = 1000 s -1 (we’ll work with small a and 
large co from now on; more on this below). The second plot is a zoomed-in version of 
the first one near t = 0. 


theta theta 



The important point here is that the tiny oscillations in 9 shown in the second plot 
are correlated with cos(wf). It turns out that the 9 value at the t where cos(wf) = 1 
is larger than the 9 value at the t where cos(wf) = -1. So there is a net negative 
contribution to the —au) 2 9cos(cot) part of the acceleration. And it may indeed be 
large enough to keep the pendulum up, as we will now show. 

To get a handle on the —aui 2 9 cos(u>t) term, let’s work in the approximation where u> 
is large and a = A/i is small. More precisely, we will assume 1 and cluj 2 » o)q, 
for reasons we will explain below. Look at one of the little oscillations in the second of 
the above plots. These oscillations have frequency u>, because they are due simply to 
the support moving up and down. When the support moves up, 9 increases; and when 
the support moves down, 9 decreases. Since the average position of the pendulum 
doesn’t change much over one of these small periods, we can look for an approximate 
solution to eq. (5.123) of the form 

9(t) w C + bcos(ut), (5.124) 

where b -C C. C will change over time, but on the scale of 1 /w it is essentially 
constant, if a = A/t is small enough. 

Plugging this guess for 9 into eq. (5.123), and using o«l and a;J 2 » Uq, we find 
that — boj 2 cos (cot) + Cauj 2 cos (u>t) = 0, to leading order. 14 So we must have b = aC. 

14 The reasons for the a <C 1 and aw 2 u>o qualifications are the following. If aw 2 then 

the aw 2 cos(urf) term dominates the u>o term in eq. (5.123). The one exception to this is when 
cos(uit) « 0, but this occurs for a negligibly small amount of time if aoj 2 If a 1, then 

we can legally ignore the C term when eq. (5.124) is substituted into eq. (5.123). We will find 
below, in eq. (5.126), that our assumptions lead to C being roughly proportional to a 2 u) 2 . Since 
the other terms in eq. (5.123) are proportional to auj 2 , we need a <C 1 in order for the C term to be 
negligible. In short, a -C 1 is the condition under which C varies slowly on the time scale of 1/u;. 
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Our approximate solution for 9 is therefore 

6 « C^l + acos(u;t)^. 


(5.125) 


Let’s now determine how C gradually changes with time. From eq. (5.123), the 
average acceleration of 9, over a period T = is 


9 = —9 (au> 2 cos(cjt) — 

— c(l + acos(ut)j (au> 2 cos(wt) — Wq) 
= — C (a 2 <jj 2 cos 2 (cJt) — Wq) 



= -Cfl 2 , (5.126) 


where 


ft 


9 
i ‘ 


(5.127) 


But if we take two derivatives of eq. (5.124), we see that 9 simply equals C. Equating 
this value of 9 with the one in eq. (5.126) gives 


(7(f) + Cl 2 C(t) w 0. 


(5.128) 


This equation describes nice simple-harmonic motion. Therefore, C oscillates sinu¬ 
soidally with the frequency given in eq. (5.127). This is the overall back and forth 
motion seen in the first of the above plots. Note that we must have auj > V2u)o if 
this frequency is to be real so that the pendulum stays up. Since we have assumed 
a C 1 , we see that a 2 oj 2 > 2u>q implies au> 2 uj'q, which is consistent with our initial 
assumption above. 

If aui cl-'o, then eq. (5.127) gives Q ~ au)/V 2. Such is the case if we change the setup 
and simply have the pendulum lie flat on a horizontal table where the acceleration 
from gravity is zero. In this limit where g is irrelevant, dimensional analysis implies 
that the frequency of the C oscillations must be a multiple of u>, because oj is the only 
quantity in the problem with units of frequency. It just so happens that the multiple 
is a/V2. 

6. Minimum or saddle 


(a) For the given £(t), the integrand in eq. (5.25) is symmetric around the midpoint, 
so we obtain 


AS = 



(5.129) 


This is negative if T > y/l2m/k = 2\/3/u>. Since the period of the oscillation is 
r = 27 t/uj, we see that T must be greater than (\/3/7r )r in order for AS to be 
negative (provided that we are using our triangular function for £). 
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(b) With £(f) = esm(nt/T), the integrand in eq. (5.25) becomes 
AS = 


where we have used the fact that the average value of sin 2 9 and cos 2 9 over 
half of a period is 1/2 (o r you can just do the integrals). This result for A S is 
negative if T > n^/m/k = tt/cj = r/2, where r is the period. 

Remark: It turns out that the £(t) oc sin(7r t/T) function gives the best chance of 
making AS negative. You can show this by invoking a theorem from Fourier analysis 
that says that any function satisfying £(0) = £(T) = 0 can be written as the sum 
£(t) = c n sin(mrt/T), where the c n are numerical coefficients. When this sum is 
plugged into eq. (5.25), you can show that all the cross terms (terms involving two 
different values of n) integrate to zero. Using the fact that the average value of sin 2 9 
and cos 2 9 is 1/2, the rest of the integral yields 

A5 = (5-131) 

In order to obtain the smallest value of T that can make this sum negative, we want 
only the n = 1 term to exist. We then have £(t) is ci sin(7rt/T), and eq. (5.131) 
reduces to eq. (5.130), as it should. 

As mentioned in Remark 4 in Section 5.2, it is always possible to make AS positive by 
picking a £(f) function that is small but wiggles very fast. Therefore, we see that for a 
harmonic oscillator, if T > r/2, then the stationary value of S is a saddle point (some 
£’s make AS positive, and some make it negative), but if T < t/2, then the stationary 
value of S is a minimum (all £’s make AS positive). In the latter case, the point is 
that T is small enough so that there is no way for £ to get large, without making £ 
large also. £ 

7. Normal force from a plane 

First Solution: The most convenient coordinates in this problem are w and z, 
where w is the distance upward along the plane, and z is the distance perpendicularly 
away from it. The Lagrangian is then 

^ m(w 2 + z 2 ) — mg(w sin 9 + zcos9) — V(z), (5.132) 

where V(z) is the (very steep) constraining potential. The two equations of motion 
are 


1 r 

2 Jo 


-cos (nt/T) 


- fc(esin( 7 rf/T’)) j i 


(5.130) 


m-w = —mg sin 9, 

dV 

mz = —mgcos9——. (5.133) 

At this point we invoke the constraint 2 = 0. So z = 0, and the second equation gives 
F c = — F'(0) = mg cos 9, (5.134) 

as desired. We also obtain the usual result, w = —g sin 0. 
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Second Solution: We can also solve this problem by using the horizontal and 
vertical components, x and y. We’ll choose (x,y) = (0,0) to be at the top of the 
plane; see Fig. 5.35. The (very steep) constraining potential is V(z), where z = 
a;sin# + y cos# is the distance from the mass to the plane (as you can verify). The 
Lagrangian is then 

L = *m(x 2 + y 2 )- mgy - V(z) (5.135) 

Keeping in mind that z = x sin 9 + y cos #, the two equations of motion are (using the 
chain rule) 

dV dz . 

mx = —— — = — V (z) sin#, 

dz ox 

my = —mg — —j— — = —mg — V'{z) cos #. (5.136) 

At this point we invoke the constraint condition x = —y cot # (that is, 2 = 0). This 
condition, along with the two E-L equations, allows us to solve for the three unknowns, 
x, y, and U'(0). Using x = —y cot# in eqs. (5.136), we find 

x = g cos# sin#, y = — gsin 2 #, F c =—V'(0) = mg cos 9. (5.137) 


The first two results here are simply the horizontal and vertical components of the 
acceleration along the plane. 

8. Bead on a stick 

There is no potential energy here, so the Lagrangian simply consists of the kinetic 
energy, which comes from the radial and tangential motions: 



Eq. (5.52) therefore gives 

E = ^ mr 2 - l -mr 2 u 2 . (5.139) 

Claim 5.3 says that this quantity is conserved, because dL/dt = 0. But it is not the 
energy of the bead, due to the minus sign in the second term. 

The point here is that in order to keep the stick rotating at a constant angular speed, 
there must be an external force acting it. This force will cause work to be done on 
the bead, thereby changing its kinetic energy. The kinetic energy, T, is therefore not 
conserved. From the above equations, we see that E = T — mr 2 u> 2 is the quantity 
that is constant in time. 

See Exercise 10 for some F = ma ways to show that the quantity E is conserved. 

9. Atwood’s machine 

First solution: If the left mass goes up by x and the right mass goes up by y, then 
conservation of string says that the middle mass must go down by x + y. Therefore, 
the Lagrangian of the system is 


L 


^( 4 m)x 2 + ^(3 m)(-x - y) 2 + ^ my 2 - ((4 m)gx + (3m)g(-x - y) + mgy ) 
^-mx 2 + 3 mxy + 2 my 2 - mg(x - 2 y). 



Figure 5.35 


(5.140) 
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This is invariant under the transformation x —> x + 2e and y —> y + e. Hence, we can 
use Noether’s theorem, with K x = 2 and K y = 1. The conserved momentum is then 

P = = TO ( 7 ® + 3y)(2) + m(3x + 4y)(l) = m(17x + lOy). (5.141) 

This P is constant. In particular, if the system starts at rest, then x always equals 
—(10/17 )y. 

Second solution: The Euler-Lagrange equations are, from eq. (5.140), 



Figure 5.36 


7 mx + 3my = —mg, 

3 mx + Amy = 2 mg. (5.142) 

Adding the second equation to twice the first gives 

17rmr + 10my = 0 =>■ ^ ^17mi; + lOmy^ = 0. (5.143) 

Third solution: We can also solve this problem using F = ma. Since the tension, 
T, is the same throughout the rope, we see that the three F = dP/dt equations are 

2T - im <’ = d ¥- 2T - 3m <’ = d ¥’ < 5144 » 

The three forces depend on only two parameters, so there will be some combination 
of them that adds up to zero. If we set a(2T — 4 mg) + b(2T — 3mg) + c(2T — mg) = 0, 
then we have a + b + c = 0 and 4a + 36 + c = 0, which is satisfied by a = 2, b = —3, 
and c = 1. Therefore, 

0 = f t (2P4m-3P 3m + P m ) 

= ^ ^2(4m)i: - 3(3m) (-a; - y) + my ) 

= ^-(17mx + lOmy). (5.145) 

dt 

10. Hoop and pulley 

Let the radius to M make an angle 9 with the vertical (see Fig. 5.36). Then the 
coordinates of M are 7?(sin 9, — cos 6). The height of to, relative to its position when 
M is at the bottom of the hoop, is y = —R9. The Lagrangian is therefore (and yes, 
we’ve chosen a different y = 0 point for each mass, but such a definition only changes 
the potential by a constant amount, which is irrelevant) 

L= ^(M + m)R 2 6 2 + MgRcos6 + mgR0. (5.146) 

The equation of motion is then 

(M + m)R9 = g(m-Msm9). (5.147) 

This is, of course, just F = ma along the direction of the string (because Mg sin 9 is 
the tangential component of the gravitational force on M). 
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Equilibrium occurs when 0 = 0 = 0. From eq. (5.147), we see that this happens at 
sin 9q = m/M. Letting 0 = 0q + 8, and expanding eq. (5.147) to first order in 8, gives 


1+ 


( Mg cos d 0 
\(M + m)RJ 


(5 = 0 . 


The frequency of small oscillations is therefore 




M cos 0 o 
M + m 


[J 

V R 


I M — m \ 1/4 IJ 
\M + m) \/r 


(5.148) 


(5.149) 


where we have used cos 0q = \J 1 — sin#§. 


Remarks: If M > m, then 0o « 0, and u> w y/g/R. This makes sense, because m can be 
ignored, and M essentially oscillates about the bottom of the hoop, just like a pendulum of 
length R. 

If M is only slightly greater than m, then 0o « tt/2, and u> « 0. This also makes sense, 
because if 0 « 7t/2, the restoring force g(m — M sin 0) does not change much as 6 changes 
(the derivative of sin# is zero at 0 = 7r/2), so it’s as if we have a pendulum in a weak 
gravitational field. 

We can actually derive the frequency in eq. (5.149) without doing any calculations. Look at 
M at the equilibrium position. The tangential forces on it cancel, and the radially inward 
force from the hoop must be Mg cos 0o to balance the radial outward component of the 
gravitational force. Therefore, for all the mass M knows, it is sitting at the bottom of a 
hoop of radius R in a world where gravity has strength g' = g cos Oq. The general formula for 
the frequency of a pendulum (as you can quickly show) is u> = yjF'/M'R, where F' is the 
gravitational force (which is Mg' here), and M' is the total mass being accelerated (which 
is M + m here). This gives the ui in eq. (5.149). & 

11. Bead on a rotating hoop 

Let 0 be the angle that the radius to the bead makes with the vertical (see Fig. 5.37). 
Breaking the velocity up into the component along the hoop plus the component 
perpendicular to the hoop, we find 

L = ^ m(uj 2 R 2 sin 2 0 + R 2 0 2 ) + mgRcosO. (5.150) 

The equation of motion is then 

R0 = sin 0(co 2 RcosO-g). (5.151) 


The F = ma interpretation of this is that the component of gravity pulling downward 
along the hoop accounts for the acceleration along the hoop plus the component of 
the centripetal acceleration along the hoop. 

Equilibrium occurs when 0 = 0= 0. The right-hand side of eq. (5.151) equals zero 
when either sin# = 0 (that is, 0 = 0 or 0 = n) or cos 0 = g/(uj 2 R). Since cos 0 must 
be less than or equal to 1, this second condition is possible only if uj 2 > g/R. So we 
have two cases: 

• If u) 2 < g/R, then 0 = 0 and 0 = -rr are the only equilibrium points. 

The 0 = 7T case is unstable. This is fairly intuitive, but it can also be seen 
mathematically by letting 0 = n + 6, where 8 is small. Eq. (5.151) then becomes 

8 - S(u 2 + g/R ) = 0. 





Figure 5.37 


(5.152) 
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m 



Figure 5.38 


The coefficient of 5 is negative, so this does not admit oscillatory solutions. 

The 0 = 0 case turns out to be stable. For small 0, eq. (5.151) becomes 

0 + 0(g/R - tu 2 ) = 0. (5.153) 

The coefficient of 0 is p ositive, so we have sinusoidal solutions. The frequency 
of small oscillations is y/ g/R— u> 2 . This goes to zero asw-» \Jg/R. 

• If u> 2 > g/R, then 0 = 0, 0 = n, and cos 0 o = g/(oj‘ 2 R) are all equilibrium points. 
The 0 = 7r case is again unstable, by looking at eq. (5.152). And the 0 = 0 case 
is also unstable, because the coefficient of 0 in eq. (5.153) is now negative (or 
zero, if J 1 = g/R). 

Therefore, cos 0o = g/ (w 2 f?) is the only stable equilibrium. To find the frequency 
of small oscillations, let 0 = 0o + S in eq. (5.151), and expand to first order in 
5. Using cos0 0 = g/(u> 2 R), we find 

6 + u) 2 sin 2 0 o S = 0. (5.154) 

The frequency of small oscillations is therefore u sin 0o = y/w 2 — g 2 /R 2 to 2 . 

Remark: This frequency goes to zero asw-> y/ g/R. And it approximately equals 
u) as ui —> oo. This second limit can be viewed in the following way. For very large 
uj, gravity is not very important, and the bead essentially feels a centripetal force of 
muj 2 R as it moves near 6 = 7r/2. So for all the bead knows, it is a pendulum of length 
R in a world where “gravity” pulls sideways with a force mupR = mg'. The frequency 
of such a pendulum is y /g'/R = u> 2 R/R = w. ft 

The frequency u =>.' 4 @t;g/R is the critical frequency above which there is a stable 
equilibrium at 0 A 0, that is, above which the mass will want to move away from the 
bottom of the hoop. 

12. Another bead on a rotating hoop 

Let the angles lot and 0 be defined as in Fig. 5.38. Then the cartesian coordinates 
for the bead are 

(x, y) = ^Rcosut + r cos(uit + 0), Rsiruvt + r sin(u;f + 0)j. (5.155) 

The velocity is then 

(x,y) = (^ —ujR sinujt-r(uj+0)sm(ijjt+0), u>Rcosu>t+r{w+0) cos(iuf + 0)^. (5.156) 
The square of the speed is therefore 
v 2 = R 2 u> 2 + r 2 {u> + 0) 2 

+2Rno(ui + 0) ^ smut sin(wf + 0) + cos uit cos(wf + 0)J 
= R 2 uj 2 +r 2 (u + Of + 2 Rru>(u + 0) cos 0 (5.157) 

There is no potential energy, so the Lagrangian is simply L = mv 2 / 2. The equation 
of motion is then, as you can show, 

rO + Rlo 2 sin# = 0. (5.158) 

Equilibrium occurs when 0 = 0 = 0, and so eq. (5.158) tells us that the equilibrium 
is located at 0 = 0, which makes intuitive sense. (Another solution is 0 = n, but 
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that’s an unstable equilibrium.) A small-angle approximation in jsq. (5.158) gives 
6 + (R/r)ui 2 Q = 0, so the frequency of small oscillations is il = ui^R/r. 


Remarks: If R <C r, then f! « 0. This makes sense, because the frictionless hoop is 
essentially not moving. If R = r, then fi = u. If R 'Jl> r, then O is very large. In this case, 
we can double-check the fl = uiyjR/r result in the following way. In the accelerating frame 
of the hoop, the bead feels a centrifugal force (discussed in Chapter 9) of m(R + r)u> 2 . For 
all the bead knows, it is in a gravitational field with strength g' = (R + r)u> 2 . So the bead 
(which acts like a pendulum of length r), oscillates with a frequency equal to 


^/7_ ^ (r+7v^ _^ 


(5.159) 


for R > r. 

Note that if we try to use this “effective gravity” argument as a double check for smaller 
values of R, we get the wrong answer. For example, if R = r, we obtain an oscillation 
frequency of uty/zR/r, instead of the correct value a >^jR/r. This is because in reality the 
centrifugal force fans out near the equilibrium point, while our “effective gravity” argument 
assumes that the field lines are parallel (and so it gives a frequency that is too large). Jl» 

13. Rotating curve 

The speed along the curve is x\J 1 + y' 2 , and the speed perpendicular to the curve is 
wx. So the Lagrangian is 

L = ^m( W ¥ + x 2 (l + y' 2 )) - mgy , (5.160) 

where y(x) = b(x/a) x . The equation of motion is then 


d_ 

dt\dx) dx 


x(l + y' 2 ) + x 2 y'y" = ui 2 x - gy'. (5.161) 


Equilibrium occurs when x 
of x satisfies 


x = 0, so eq. (5.161) implies that the equilibrium value 


x 0 


9V'(x q ) 


(5.162) 


The F = ma explanation for this is that the component of gravity along the curve 
accounts for the component of the centripetal acceleration along the curve. Using 
y(x) = b(x/a) x , eq. (5.162) yields 


/ 2 2X 1/( A-2) 

x„ = a( —) . (5.183) 

As A —> oo, we see that x 0 goes to a. This makes sense, because the curve essentially 
equals zero up to a, and then it rises very steeply. You can check numerous other 
limits. 

Letting x = xo + S in eq. (5.161), and expanding to first order in gives 

<5(l + y'ixo) 2 ^ = 5(u 2 -gy"(x 0 )y (5.164) 

The frequency of small oscillations is therefore 

o2 _ 9V"(x 0) - ¥ 

l + y'(xo ) 2 


(5.165) 
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Figure 5.39 


Using the explicit form of y, along with eq. (5.163), we find 


Cl 2 


(A - 2)u> 2 


(5.166) 


We see that A must be greater than 2 in order for there to be oscillatory motion 
around the equilibrium point. For A < 2, the equilibrium point is unstable, that is, 
to the left the force is inward, and to the right the force is outward. 

For the case A = 2, the equilibrium condition, eq. (5.162), gives Xq = ( 2gb/a 2 u> 2 )xo. 
For this to be true for some xo, we must have ui 2 = 2gb/a 2 . But if this holds, then 
eq. (5.162) is true for all x. So in the special case of A = 2, the bead will happily 
sit anywhere on the curve if u 2 = 2gb/a 2 . (In the rotating frame of the curve, the 
tangential components of the centrifugal and gravitational forces exactly cancel at all 
points.) If ui 2 A 2gb/a 2 , then the particle feels a force either always inward or always 
outward. 

Remarks: For u> —> 0, eqs. (5.163) and (5.166) give xo —■> 0 and 12 —> 0. And for u> —> oo, 
they give xo —*• oo and 12 —> 0. In both cases 12 —» 0, because in both case the equilibrium 
position is at a place where the curve is very flat (horizontally or vertically, respectively), so 
the restoring force is very small. 

For A —> oo, we have xo —>• a and 12 —> oo. The frequency is large here because the equilibrium 
position at a is where the curve has a sharp corner, so the restoring force changes quickly with 
position. Or, you can think of it as a pendulum with a very small length (if you approximate 
the “corner” by a tiny circle). & 

14. Mass on a wheel 

Let the angle 0 be defined as in Fig. 5.39, with the convention that 0 is positive if 
M is to the right of to. Then the position of rn in cartesian coordinates, relative to 
the point where to would be in contact with the ground, is 

(x,y) m = R(9- sin#, 1-cos#). (5-167) 

We have used the non-slipping condition to say that the present contact point is a dis¬ 
tance R6 to the right of where to would be in contact with the ground. Differentiating 
eq. (5.167), we find that the square of to’ s speed is v ^ = 2R 2 0 2 (1 — cos #). 

The position of M is (x. y)M = R{0, 1), so the square of its speed is v 2 M = R 2 0 2 . The 
Lagrangian is therefore 

L = ^MR 2 9 2 + mR 2 9 2 (1 — cos 6) + mgR cos 0, (5.168) 

where we have measured both potential energies relative to the height of M. The 
equation of motion is 

MR 2 6 + 2mR 2 9(l — cos#) + mR 2 9 2 sin# + mgRsm.9 = 0. (5.169) 

In the case of small oscillations, we may use cos# ~ 1 — 9 2 /2 and sin# rts 9. The 
second and third terms above are third order in # and may be neglected, so we find 



(5.170) 
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The frequency of small oscillations is therefore 




(5.171) 


Remarks: If M > m, then u> —> 0. This makes sense. 

If m > M, then io —> oo. This also makes sense, because the huge mg force makes the 
situation similar to one where the wheel is bolted to the floor, in which case the wheel 
vibrates with a high frequency. 

Eq. (5.171) can actually be derived in a much quicker way, using torque (which will be 
discussed in Chapter 7). For small oscillations, the gravitational force on m produces a 
torque of —mgRO around the contact point on the ground. For small 0, m has essentially no 
moment of inertia around the contact point, so the total moment of inertia is simply MR?. 
Therefore, r = la gives —mgR6 = MR 2 0, from which the result follows. X 
15. Double pendulum 

Relative to the pivot point, the cartesian coordinates of mi and m 2 are, respectively 
(see Fig. 5.40), 


(x,y) 1 = (4sinl?i,—4cos0i), 

(x,y) 2 = (£1 sin#i + £2 sin0 2 , — 1\ cos 0i — (.2 cos0 2 ). (5.172) 

Taking the derivative to find the velocities, and then squaring, gives 

4 = £\el 

v\ = t\6\ + £le% + 2^4M 2 (cos 6 X cos 0 2 + sin64 sin0 2 ). (5.173) 

The Lagrangian is therefore 

L = I mi t\b\ + (ifel + £\e\ + 2£ 1 £ 2 e 1 e 2 cos(0i - 0 2 )) 

+m\g£\ cos + m 2 g(£i cos0i + £2 cos 0 2 )- (5.174) 

The equations of motion obtained from varying 9 1 and 0 2 are 

0 = (mi + m 2 )^f(9i + m 2 ^i^ 2 02 cos( 0 i - 0 2 ) + m 2 ^i ^ 2 02 sin(0i - 0 2 ) 

+(mi + m 2 )^i sin 0i, 

0 = m 2 ^0 2 + m 2 ^i^ 2 0i cos(0i - 0 2 ) - m 2 £i£ 20 l sin(0i - 0 2 ) 

+rn 2 <jff 2 sin 0 2 . (5.175) 

This is a bit of a mess, but it simplifies greatly if we consider small oscillations. Using 
the small-angle approximations and keeping only the leading-order terms, we obtain 

0 = (mi + m 2 )^i0i + m 2 £ 2 0 2 + (mi + m 2 ) 50 i, 

0 = £202 + £iOi + g9 2 . (5.176) 



Figure 5.40 


Consider now the special case, £ x = £2 = £■ We can find the frequencies of the normal 
modes by using the determinant method, discussed in Section 3.5. You can show that 

the result is r _ 

mi + m 2 ± - v /mirn 2 + ml Lg 
m1 ]/£' 


lo± = 


(5.177) 
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The normal modes are found to be, after some simplification, 


h(t) 

h(t) 


= F^/TO2 
yjm\ + m2 


COS (w±t+ <j)±). 


Some special cases are: 

• mi = m 2 : The frequencies are 


u>± = 


The normal modes are 


hit) 
h(t ) 


Tl 

V2 


COs(w ± t +'4afe)- 


(5.178) 


(5.179) 


(5.180) 


• mi > m 2 : With m 2 /mi = e, the frequencies are (to leading nontrivial order in 

4 

w ± = (l±Vi/2)^. ( 5 . 181 ) 

The normal modes are 

( h® ) ± = ( ) cos ^ t + ^- ( 5 ‘ 182 ) 

In both modes, the upper (heavy) mass essentially stands still, and the lower 
(light) mass oscillates like a pendulum of length l. 

• mi C m 2 : With m\/m 2 = e, the frequencies are (to leading order in e) 

# <5 ' 183) 

The normal modes are 

(Sw) ± = (?) cos( “ ±f+0±) ' <5184) 

In the first mode, the lower (heavy) mass essentially stands still, and the upper 
(light) mass vibrates back and forth at a high frequency (because there is a very 
large tension in the rods). In the second mode, the rods form a straight line, 
and the system is essentially a pendulum of length 21. 



Consider now the special case, mi = m 2 . Using the determinant method, you can 
show that the frequencies of the normal modes are 


= y/g< 


h + e 2 ±^l{+e 2 

£i£2 


(5.185) 


The normal modes are found to be, after some simplification, 


h(t) ' 
Ht) , 


) COs(u/-fcf + <j)±). 


(5.186) 


Some special cases are: 
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• t\ = d' 2 ' We already considered this case above. You show that eqs. (5.185) 
and (5.186) agree with eqs. (5.179) and (5.180), respectively. 

• £\ » £%: With I 2 /t\ = e, the frequencies are (to leading order in e) 



(!«)))+ = ( 2 ' ) COS< “ + ‘ + W ' 

( ^(t) ) = ( 1 ) cos (u_f+ <£_). (5.188) 


In the first mode, the masses essentially move equal distances in opposite direc¬ 
tions, at a very high frequency (because i 2 is so small). In the second mode, 
the rods form a straight line, and the masses move just like a mass of 2m. The 
system is essentially a pendulum of length l. 

• £\ -C £ 2 ' With £\/£2 = e, the frequencies are (to leading order in e) 



(«!)+ = ( _ e )°»K<+* + ). 

(«)«)_ = ( * )cos(^_( + ^_). 


(5.189) 


(5.190) 


In the first mode, the bottom mass essentially stands still, and the top mass 
oscillates at a very high frequency (because £\ is so small). The factor of 2 in 
the frequency arises because the top mass essentially lives in a world where the 
acceleration from gravity is g' = 2 g (because of the extra mg force downward 
from the lower mass). In the second mode, the system is essentially a pendulum 
of length £ 2 . The factor of 2 in the angles is what is needed to make the 
tangential force on the top mass roughly equal to zero (because otherwise it 
would oscillate at a high frequency, since £\ is so small). 

16. Pendulum with a free support 

Let x be the coordinate of M, and let 9 be the angle of the pendulum (see Fig. 5.41). 
Then the position of the mass m in cartesian coordinates is (x + £sin9,—£cos0). 
Taking the derivative to find the velocity, and then squaring to find the speed, gives 
v^ n = x 2 + £ 2 9 2 + 2£x6cos6. The Lagrangian is therefore 


L = ^Mx 2 + ^ m(x 2 + £ 2 9 2 + 2 £±6 cos 9) + mg£ cos 9. 
The equations of motion from obtained varying x and 9 are 


(M + m)x + m£0 cos 9 — m£9 2 sin 9 = 
£9 + x cos 9 + g sin 9 = 


(5.191) 



Figure 5.41 


0, 


(5.192) 
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Figure 5.42 


If 9 is small, we can use the small angle approximations, cos 9 ~ 1 — 9 2 /2 and sin 9 w 9. 
Keeping only the terms that are first-order in 9, we obtain 

(Af + m)x + mt.6 = 0, 

x + W + g9 = 0. (5.193) 


The first equation expresses momentum conservation. Integrating it twice gives 


(5.194) 


The second equation is F = ma in the tangential direction. Eliminating x from eqs. 
(5.193) gives 

#+ (M + m)| 9 = 0 (5.195) 

The solution to this equation is 9{t) = C cos (cut + 0), where 



The general solutions for 9 and x are therefore 


(5.196) 


9(t) = Ccos(cot + 4>), x(t) = — m m cos (ujt + <j>) + At+ B. (5.197) 

The constant B is irrelevant, so we’ll ignore it. The two normal modes are: 

• A = 0: In this case, x = —9ml/(M + m). Both masses oscillate with the 
frequency w given in eq. (5.196), always moving in opposite directions. The 
center of mass does not move. 

• C = 0: In this case, 9 = 0 and x = At. The pendulum hangs vertically, 
with both masses moving horizontally at the same speed. The frequency of 
oscillations is zero in this mode. 


Remarks: If M > m, then y = sjg/£, as expected, because the support essentially stays 
still. 

If m > M, then a; —> /m/M \J~gfl — > oo. This makes sense, because the tension in the 
rod is so large. We can actually be quantitative about this limit. For small oscillations and 
for m M, the tension of mg in the rod produces a sideways force of rngO on M. So the 
horizontal F = Ma equation for M is mg6 = Mx, But x « —16 in this limit, so we have 
mgO = —M£6, from which the result follows. Jit 

17. Pendulum support on an inclined plane 

Let z be the coordinate of M along the plane, and let 9 be the angle of the pendulum 
(see Fig. 5.42). In cartesian coordinates, the positions of M and m are 

(■ x,y) M = {zcos/3,-zsin/3), 

(x,y) m = (zcos/3 + £sin9, — z sin ft — £cos9). (5.198) 

Differentiating these positions, we find that the squares of the speeds are 

n, 2 _ ~2 

V M — Z ! 

v ^ = z 2 + £ 2 9 2 + 2£z9(cos/3 cos 9 - sin/3 sin 9). 


(5.199) 





5.11. SOLUTIONS 


V-51 


The Lagrangian is therefore 

^Mi 2 + ]^m(z 2 + £ 2 9 2 + 2£z9 cos(0 + /?)^ +MgzsmP + mg(zsm/3+£cos9). (5.200) 
The equations of motion obtained from varying 2 and 8 are 

(M + m)z + m£ (0cos(0 + /3)-0 2 sin(0 + /?)) = (M + m)g sin/3, 

£8 + zcos(9 + /3) = —gsinO. (5.201) 

Let us now consider small oscillations about the equilibrium point (where 9 = 8 = 0). 
We must first determine where this point is. The first equation above gives z = g sin 6. 
The second equation then gives gsin/3cos(0 + /3) = —<y sin 6*. By expanding the cosine 
term, we find tan 9 = — tan (3, so 9 = —(3. (9 = it — f3 is also a solution, but this is an 
unstable equilibrium.) The equilibrium position of the pendulum is therefore where 
the string is perpendicular to the plane. 15 

To find the normal modes and frequencies of small oscillations, let 9 = —fi + 8, and 
expand eqs. (5.201) to first order in 8. Letting g = z — g sin/? for convenience, we 
have 


(M + m)g + m£8 = 0, 

g + £8 + (g cos [3)8 = 0. (5.202) 


Using the determinant method (or using the method in the previous problem; either 
way works), the frequencies of the normal modes are found to be 


u>i = 0 , 


and 


w 2 = 


(5.203) 


These are the same as the frequencies in Problem 16 (where M moves horizontally), 
but with gcosfi in place of g . 16 (Compare eqs. (5.202) with eqs. (5.193).) Looking 
at eq. (5.197), and recalling the definition of 77 , we see that the general solutions for 
9 and z are 

9(t) = —p + C cos (ut + 4>), z(t) = ~ cos (ut + 4>) + 9 s ^^ t 2 + At + B. 

(5.204) 

The constant B is irrelevant, so we’ll ignore it. The basic difference between these 
normal modes and the ones in Problem 16 is the acceleration down the plane. If you 
go to a frame that accelerates down the plane at g sin /?, and if you tilt your head at 
an angle /? and accept the fact that g' = g cos (3 in your world, then the setup becomes 
identical to the one in Problem 16. 


ls This makes sense. Because the tension in the string is perpendicular to the plane, for all the 
pendulum bob knows, it may as well simply be sliding down a plane parallel to the given one, a 
distance l away. Given the same initial speed, the two masses will slide down their two “planes” 
with equal speeds at all times. 

16 This makes sense, because in a frame that accelerates down the plane at g sin 3. the only 
external force on the masses is an effective gravity force of g cos /3 perpendicular to the plane. As 
far as M and m are concerned, they live in a world where gravity pulls “downward” (perpendicular 
to the plane) with strength g' = g cos f3. 
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18. Tilting plane 

Relative to the support, the positions of the masses are 
(■ x,y) M = (£sin9,-£cos9), 

( x,y) m = (£sin9 + xcos9,—£cos9 + xsin9). (5.205) 

Differentiating these positions, we find that the squares of the speeds are 

v 2 f = £ 2 i9 2 , v 2 m = (£9 + x) 2 + x 2 9 2 . (5.206) 

You can also obtain v 2 n by noting that (£9 + x) is the speed along the long rod, and 
x9 is the speed perpendicular to it. The Lagrangian is 

L=^M£ 2 9 2 + ^m((£9 + x) 2 + x 2 9 2 ^+Mg£cos9 + mg(£cos9-xsm9). (5.207) 
The equations of motion obtained from varying x and 9 are 

£6 + x = x9 2 — gsinO, (5.208) 

M£ 2 6 + m£(£9 + x) + mx 2 0 + 2mxx6 = — (M + m)g£sm0 — mgxcosO. 

Let us now consider the case where both x and 6 are small (or more precisely, 9 1 

and x/£ -C 1). Expanding eqs. (5.208) to first order in 9 and x/1 gives 

(£9 + x) + g9 = 0, 

M£(£9 + g9 ) + m£(£9 + x) + mg£9 + mgx = 0. (5.209) 

We can simplify these a bit. Using the first equation to substitute — g9 for (£9 + x), 
and also — x for (£9 + g9), in the second equation gives 

£9 + x + g9 = 0, 

—Mix + mgx = 0. (5.210) 

The normal modes can be found using the determinant method, or we can find 
them just by inspection. Th e second equation says that either x(t) = 0, or x(t) = 
Acosh(af + /?), where a = \Jmg/M£. So we have two cases: 

• If x(t) = 0, then the first equation in (5.210) says that the normal mode is 

)cosM + # t (5.211) 

where ui = \fg]~£. This mode is fairly clear. With the proper initial conditions, 
m will stay right where M is. The normal force from the long rod will be exactly 
what is needed in order for m to undergo the same oscillatory motion as M. 

• If x(t) = A cosh (of + 6), then the first equation in (5.210) can be solved to give 
the normal mode, 

( x ) = C ( l(M + m) ) cosh( “‘ + <5 ' 212) 

where a = yjmg/M£. This mode is not as clear. And indeed, its range of 
validity is rather limited. The exponential behavior will quickly make x and 9 
large, and thus outside the validity of our small-variable approximations. You 
can show that in this mode the center of mass remains fixed, directly below the 
pivot. This can occur, for example, by having to move down to the right as the 
rods rotate and swing M up to the left. There is no oscillation in this mode; 
the positions keep growing. 
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19. Motion in a cone 

If the particle’s distance from the axis is r, then its height is r /tan a, and it’s distance 
up along the cone is r/ sin a. Breaking the velocity into components up along the cone 
and around the cone, we see that the square of the speed is v 2 = r 2 / sin 2 a + r 2 9 2 . 
The Lagrangian is therefore 


L = 


X m { " 2 - 
2 \ sin 2 a 


mgr 

tana 


(5.213) 


The equations of motion obtained from varying 6 and r are 
|(mr 2 0 ) = 0 

f = r6 2 sin 2 a — g cos a sin a. (5.214) 


The first of these equations expresses conservation of angular momentum. The second 
equation is more transparent if we divide through by sin a. With x = r/ sin a being 
the distance up along the cone, we have x = (r0 2 ) sin a — g cos a. This is the F = ma 
statement for the “a” direction. 

Letting mr 2 8 = L, we may eliminate 8 from the second equation to obtain 


L 2 sin 2 a 
TO 2 r 3 


g cos oi sin a. 


(5.215) 


We will now calculate the two desired frequencies. 

• Frequency of circular oscillations, u: For circular motion with r = ro, we have 
r = f = 0, so the second of eqs. (5.214) gives 


(5.216) 


> Frequency of oscillations about a circle, fi: If the orbit were actually the circle 
r = ro, then eq. (5.215) would give (with r = 0) 


2 3 = g cos a si 


(5.217) 


This is equivalent to eq. (5.216), which can be seen by writing L as rnr'^O. 

We will now use our standard procedure of letting r(t) = ro + S(t), where S(t) is 
very small, and then plugging this into eq. (5.215) and expanding to first order 
in 5. Using 


1 


1 


1 


1 


(ro + #) 3 ~ ?o + ' ir o S UK 1 + 3 6/r 0 ) " 


1 -- 


3<5\ 


(5.218) 


5 =- 5 - 5 - I-I — 5 cos a sin a. (5.219) 

m 2 rg V r 0 ) 

Recalling eq. (5.217), we obtain a bit of cancellation and are left with 
•• /3L 2 sin 2 a\ 


S. 


(5.220) 
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Using eq. (5.217) again to eliminate L we have 


Therefore, 


= \ ~ su 
V ro 


sinacosa. 


(5.221) 


(5.222) 


Having found the two desired frequencies in eqs. (5.216) and (5.222), we see that 
their ratio is 

- = V3sina. (5.223) 

This ratio 0 ,/lo is independent of ro- 

The two frequencies are equal if sin a = l/\/3, that is, if a w 35.3° = a. If a = a, then 
after one revolution around the cone, r returns to the value it had at the beginning 
of the revolution. So the particle undergoes periodic motion. 


Remarks: In the limit a —* 0, eq. (5.223) says that Q/ui — > 0. In fact, eqs. (5.216) and 
(5.222) say that u —> oo and f2 —► 0. So the particle spirals around many times during one 
complete r cycle. This seems intuitive. 

In the limit a —> 7r/2 (that is, the cone is almost a flat plane) both ui and Q go to zero, and 
eq. (5.223) says that fi/a; —> V3. This result is not at all obvious (at least to me). 

If Q/ui = -\/3sina: is a rational number, then the particle will undergo periodic motion. For 
example, if a = 60°, then Q/ui = 3/2, so it takes two complete circles for r to go through 
three cycles. Or, if a = arcsin(l/2\/3) ~ 16.8°, then Q,/u> = 1/2, so it takes two complete 
circles for r to go through one cycle. 

20. Shortest distance in a plane 

Let the two given points be (xi, y\) and (x2, yi), and let the path be described by the 
function y(x). (Yes, we’ll assume it can be written as a function. Locally, we don’t 
have to worry about any double-valued issues.) Then the length of the path is 


*=£ 


(5.224) 


The “Lagrangian” is L = y/l + y' 2 , so the Euler-Lagrange equation is 


d ( dL \ _ dL 
dx V dy') dy 



We see that y'/yjl + y' 2 is constant. Therefore, y' is also constant, so we have a 
straight line y(x) = Ax + B, where A and B are determined from the endpoint 
conditions. 


21. Index of refraction 

Let the path be described by y(x). The speed at height y is v a y. Therefore, the 
time to go from (xo,yo) to (xi,yi) is 


Jx 0 v Jx 0 



y 


dx. 


(5.226) 
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Our goal is to find the function y(x) that minimizes this integral, subject to the 
boundary conditions above. We can therefore apply the results of the variational 
technique, with a “Lagrangian” equal to 



y 


At this point, we could apply the E-L equation to this L, but let’s simply use Lemma 
5.5, with f(y) = 1 /y. Eq. (5.83) gives 

1 + y' 2 = Bf(y) 2 =► 1 + y' 2 = B 2 • (5.228) 

y 

We must now integrate this. Solving for y', and then separating variables and inte¬ 
grating, gives 


hm 



(5.229) 


Therefore, (x + A) 2 +y 2 = B, which is the equation for a circle. Note that the circle 
is centered at y = 0, that is, at a point on the bottom of the slab. This is the point 
where the perpendicular bisector of the line joining the two given points intersects 
the bottom of the slab. 


22. The Brachistochrone 

First solution: In Fig. 5.43, the boundary conditions are y( 0) = 0 and y(x o) = yo, 
with downward taken to be the positive y direction. From conservation of energy, the 
speed as a function of y is v = y/2gy. The total time is therefore 


-r d i=r- 


'Aay 


(5.230) 


Our goal is to find the function y(x) that minimizes this integral, subject to the 
boundary conditions above. We can therefore apply the results of the variational 
technique, with a “Lagrangian” equal to 


L <x 



(5.231) 


At this point, we could apply the E-L equation to this L, but let’s simply use Lemma 
5.5, with f(y) = 1/yd/. Eq. (5.83) gives 

1 + y' 2 = Cf(y) 2 =► 1 + y ,2 = ~, (5.232) 

y 


as desired. We must now integrate one more time. Solving for y 1 and separating 
variables gives 

^ dy _ = ± dx. (5.233) 


A helpful change of variables to get rid of the square root in the denominator is 
y = B sin 2 <f>. Then dy = 2B sin <j> cos <j> d<f >, and eq. (5.233) simplifies to 


2 B sin 2 <f> d<j>— ± dx. 



(x 0 ,y 0 ) 

Figure 5.43 


(5.234) 
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We can now make use of the relation sin 2 0 = (1 — cos 20)/2 to integrate this. The 
result is £?(20 — sin20) — ±2x — C, where C is an integration constant. 

Now note that we may rewrite our definition of 0 (which was y = B sin 2 0) as 2 y = 
B( 1 — cos 20). If we then define 6 = 20, we have 

x = ±a(6 — sin#) ± d, y = a(l—cos6). (5.235) 

where a = B/ 2, and d = C/2. 

The particle starts at (x,y) = (0,0). Therefore, 9 starts at 0 = 0, since this corre¬ 
sponds to y = 0. The starting condition x = 0 then implies that d = 0. Also, we are 
assuming that the wire heads down to the right, so we choose the positive sign in the 
expression for x. Therefore, we finally have 

x = a(0- sin6»), y = a(l - cos6>), (5.236) 


as desired. This is the parametrization of a cycloid, which is the path taken by a 
point on the rim of a rolling wheel. The initial slope of the y(x) curve is infinite, as 
you can check. 


Remark: The above method derived the parametric form in (5.236) from scratch. But 
since eq. (5.236) was given in the statement of the problem, another route is to simply verify 
that this parametrization satisfies eq. (5.232). To this end, assume that x = a(0 — sin#) and 
y = a(l — cos#), which gives 


, _ dy _ dy/dO _ sin# 

V = dx ~ dx/dO ~ 1-cos 6 ' 


(5.237) 


Therefore, 



which agrees with eq. (5.232), with C = 2a. ft 


(5.238) 


Second solution: Let’s use a variational argument again, but now with y as the 
independent variable. That is, let the chain be described by the function x(y). The 
arclength is now given by ds = \Jl + x' 2 dy. Therefore, instead of the Lagrangian in 
eq. (5.231), we now have 


L oc 



(5.239) 


The Euler-Lagrange equation is 


a!_ /dL\ _ &L 
dy \ dx') dx 


d !_ 

dy 


= 0 . 


(5.240) 


The zero on the right-hand side makes things nice and easy, because it means that 
the quantity in parentheses is a constant. Call it D. We then have 


1 dx/dy 

\/V \/l + {dx/dy) 2 
1 1 
\Jy if (dyikxf~TX 


(5.241) 


This is equivalent to eq. (5.232), and the solution proceeds as above. 
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Third solution: Note that the “Lagrangian” in the first solution above, which is 
given in eq. (5.231) as 



(5.242) 


is independent of x. Therefore, in analogy with conservation of energy (which arises 
from a Lagrangian that is independent of t), the quantity 


b~A l -,-l = f - 

dy y/vV 1 ~y" 2 


y/V 


(5.243) 


is a constant (that is, independent of x). We have therefore again reproduced eq. 
(5.232), and the solution proceeds as above. 

23. Minimal surface 

The tension throughout the surface is constant, because it is in equilibrium. (By 
“tension” in a surface, we mean the force per unit length in the surface.) The ratio 
of the circumferences of the circular boundaries of the ring is 2/2 /2/i - Therefore, the 
condition that the horizontal forces on the ring cancel is y-\ cos 0i = 7/2 cos O2 , where 
the 0’s are the angles of the surface, as shown in Fig. 5.44. In other words, ycos6 is 
constant throughout the surface. But cos 6 = l/^/l -b y' 2 . so we have 


= C. (5.244) 

This is the same as eq. (5.75), and the solution proceeds as in Section 5.8. 

24. Existence of a minimal surface 

The general solution for y{x) is given in eq. (5.76) as 

y(x) = ^ cosh 6 ( 2 ; + d). (5.245) 

If we choose the origin to be midway between the rings, then d = 0. Both boundary 
condition are thus 

r = \ cosh b£. (5.246) 

0 

Let us now determine the maximum value of l/r for which the minimal surface exists. 
If (,/r is too large, then we will see that there is no solution for b in eq. (5.246); in 
short, the minimal “surface” turns out to be the two given circles, attached by a line, 
which isn’t a nice two-dimensional surface. If you perform an experiment with soap 
bubbles (which want to minimize their area), and if you pull the rings too far apart, 
then the surface will break and disappear, as it tries to form the two circles. 

Define the dimensionless quantities, 



* 4 / 


Figure 5.44 


£ 

77 = -, and z = br. (5.247) 

Then eq. (5.246) becomes 

2 : = cosh 77 , 2 . (5.248) 

If we make a rough plot of the graphs of w = z and w = cosh rjz for a few values 
of 77 (see Fig. 5.45), we see that there is no solution for z if 77 is too large. The 
limiting value of 77 for which there exists a solution occurs when the curves w = z 



Figure 5.45 
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and w = cosh r)z are tangent; that is, when the slopes are equal in addition to the 
functions being equal. Let % be the limiting value of r], and let % be the place where 
the tangency occurs. Then equality of the values and the slopes gives 

zo = cosh(r/ 0 zo), and 1 = %sinh(r7oZo)- (5.249) 

Dividing the second of these equations by the first gives 

1 = {Vozo) tanh(%z 0 )- (5.250) 

This must be solved numerically. The solution is 

%z 0 w 1.200. (5.251) 

Plugging this into the second of eqs. (5.249) gives 

(-} =%« 0.663. (5.252) 


Note also that z 0 = 1.200/% = 1.810. We see that if l/r is larger than 0.663, then 
there is no solution for y(x) that is consistent with the boundary conditions. Above 
this value of i/r, the soap bubble minimizes its area by heading toward the shape of 
just two disks, but it will pop well before it reaches that configuration. 


Remarks: 


(a) We glossed over one issue above, namely that there may be more than one solution for 
the constant 6 in eq. (5.246). In fact, Fig. 5.45 shows that for any r) < 0.663, there are 
two solutions for z in eq. (5.248), and hence two solutions for b in eq. (5.246). This 
means that there are two possible surfaces that might solve our problem. Which one 
do we want? It turns out that the surface corresponding to the smaller value of b is 
the one that minimizes the area, while the surface corresponding to the larger value of 
b is the one that (in some sense) maximizes the area. 

We say “in some sense” because the large-6 surface is actually a saddle point for the 
area. It can’t be a maximum, after all, because we can always make the area larger 
by adding little wiggles to it. It’s a saddle point because there does exist a class of 
variations for which it has the maximum area, namely ones where the “dip” in the 
curve is continuously made larger (just imagine lowering the midpoint in a smooth 
manner). This surface arises because the Euler-Lagrange technique simply sets the 
“derivative” equal to zero and doesn’t differentiate between maxima, minima, and 
saddle points. 

(b) How does the area of the limiting surface (with % = 0.663) compare with the area of 
the two circles? The area of the two circles is 

A c = 2nr 2 . (5.253) 


The area of the limiting surface is 


Using eq. (5.246), this becomes 
A s = 


= J 2-Kyy/l + y' 2 dx. 

J cosh 2 bx dx 

/> + cosh 2bx) dx 


(5.254) 


■ sinh 2b£ 


(5.255) 
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But from the definitions of 77 and z, we have I = rjor and b = zo/r for the limiting 
surface. Therefore, A s can be written as 

A s = tU(^ + S ' mh ^ 0Z ° y (5.256) 

Plugging in the numerical values (t?o « 0.663 and zq « 1.810) gives 

A c ss (6.28)r 2 , and A s * (7.54)r 2 . (5.257) 

The ratio of ^4 S to A c is approximately 1.2 (it’s actually rjozo, as you can show). The 
limiting surface therefore has a larger area. This is expected, of course, because for 
I/r > r) 0 the surface tries to run off to one with a smaller area, and there are no other 
stable configurations besides the cosh solution we found. 
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Chapter 6 

Central Forces 

Copyright 2004 by David Morin, morin@physics.harvard.edu 


A central force is by definition a force that points radially and whose magnitude 
depends only on the distance from the source (that is, not on the angle around 
the source). 1 Equivalently, we may say that a central force is one whose potential 
depends only on the distance from the source. That is, if the source is located at 
the origin, then the potential energy is of the form V{r) = V(r ). Such a potential 
does indeed yield a central force, because 


F(r) = —VV(r) = —-f, (6.1) 

dr 

which points radially and depends only on r. Gravitational and electrostatic forces 
are central forces, with V(r) oc 1/r. The spring force is also central, with V(r) oc 
(r — t) 2 , where £ is the equilibrium length. 

There are two important facts concerning central forces: (1) they are ubiquitous 
in nature, so we had better learn how to deal with them, and (2) dealing with 
them is much easier than you might think, because crucial simplifications occur in 
the equations of motion when V is a function of r only. These simplifications will 
become evident in the following two sections. 

6.1 Conservation of angular momentum 

Angular momentum plays a key role in dealing with central forces because, as we 
will show, it is constant in time. For a point mass, we define the angular momentum , 
L, by 

L = r x p. (6.2) 

The vector L depends, of course, on where you pick the origin of your coordinate 
system. Note that L is a vector, and that it is orthogonal to both r and p, by nature 
of the cross product. You might wonder why we care enough about r x p to give it 

1 Taken literally, the term “central force” would imply only the radial nature of the force. But a 
physicist’s definition also includes the dependence solely on the distance from the source. 


VI-1 






VI-2 


CHAPTER 6. CENTRAL FORCES 


a name. Why not look at r 3 p 5 r x (r x p), or something else? The answer is that 
there are some very nice facts concerning L, one of which is the following. 2 


Theorem 6.1 If a particle is subject to a central force only, then its angular mo¬ 
mentum is conserved. That is, 


Proof: We have 


If V(r) = V(r), then — = 0. 


dL 

dt 


d . 

* (rxp) 

dr 

* xp+r> 

v x (mv) + i 


dp 

dt 

x F 


0 , 


(6.3) 


(6.4) 


because Focr, and the cross product of two parallel vectors is zero. ■ 

We will prove this theorem again in the next section, using the Lagrangian 
method. Let’s now prove another theorem which is probably obvious, but good to 
show anyway. 

Theorem 6.2 If a particle is subject to a central force only, then its motion takes 
place in a plane. 

Proof: At a given instant, to, consider the plane, P, containing the position vector 
ro (with the source of the potential taken to be the origin) and the velocity vector 
vo. We claim that r lies in P at all times. 3 

P is defined as the plane orthogonal to the vector no e ro x vo- But in the proof 
of Theorem 6.1, we showed that the vector rxv = (rxp)/m does not change with 
time. Therefore, r x v = no for all t. Since r is certainly orthogonal to r x v, we 
see that r is orthogonal to no for all t. Hence, r must lie in P. ■ 

An intuitive look at this theorem is the following. Since the position, speed, 
and acceleration (which is proportional to F, which in turn is proportional to the 
position vector, r) vectors initially all lie in P, there is a symmetry between the two 
sides of P. Therefore, there is no reason for the particle to head out of P on one 
side rather than the other. The particle therefore remains in P. We can then use 
this same reasoning again a short time later, and so on. 

This theorem shows that we need only two coordinates, instead of the usual 
three, to describe the motion. But since we’re on a roll, why stop there? We 
will show below that we really only need one variable. Not bad, three coordinates 
reduced down to one. 

2 This is a special case of the fact that torque equals the rate of change of angular momentum. 
We’ll talk about this in great detail in Chapter 7. 

3 The plane P is not well-defined if vo = 0, or ro = 0, or vo is parallel to ro. But in these cases, 
you can easily show that the motion is always radial, which is even more restrictive than planar. 
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6.2 The effective potential 

The effective potential provides a sneaky and useful method for simplifying a 3- 
dimensional central-force problem down to a 1-dimensional problem. Let’s see how 
it works. 

Consider a particle of mass m subject to a central force only, described by the 
potential V(r). Let r and 6 be the polar coordinates in the plane of the motion. In 
these polar coordinates, the Lagrangian (which we’ll label as to save “L” for 
the angular momentum) is 


£ = I m {r 2 + r 2 9 2 ) - V{r). (6.5) 

The equations of motion obtained from varying r and 9 are 
mir = mrO 2 — V'(r), 

mr 2 0 ) = 0. (6.6) 

at 

The first equation is the force equation along the radial direction, complete with 
the centripetal acceleration, in agreement with the first of eqs. (2.52). The second 
equation is the statement of conservation of angular momentum, because mr 2 0 = 
r(mr0) = rpe (where po is the magnitude of the momentum in the angular direction), 
which is the magnitude of L = r x p. We therefore see that the magnitude of L is 
constant. And since the direction of L is always perpendicular to the fixed plane of 
the motion, the vector L is constant in time. We have therefore just given a second 
proof of Theorem 6.1. In the present Lagrangian language, the conservation of L 
follows from the fact that 9 is a cyclic coordinate, as we saw in Example 2 in Section 
5.5.1. 

Since mr 2 9 does not change in time, let us denote its constant value by 

L = mr 2 9. (6.7) 

L is determined by the initial conditions; it could be specified, for example, by giving 
the initial values of r and 9. Using 9 = L/(mr 2 ), we may eliminate 9 from the first 
of eqs. (6.6). The result is 

L 2 

mr = -.. — V'(r). (6.8) 

mr 6 

Multiplying by r and integrating with respect to time yields 

(6 - 9) 

where E is a constant of integration. E is simply the energy, which can be seen by 
noting that this equation could also have been obtained by simply using eq. (6.7) 
to eliminate 9 in the energy equation, (rn/2)( f 2 + r 2 9 2 ) + V(r) = E. 
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Eq. (6.9) is rather interesting. It involves only the variable r. And it looks a lot 
like the equation for a particle moving in one dimension (labeled by the coordinate 
r) under the influence of the potential 


Ee ff (r) = 


2 mr 2 


bV(r) 


( 6 . 10 ) 


The subscript “eff” here stands for “effective”. V e g(r) is called the effective potential. 
The “effective force” is easily read off from eq. (6.8) to be 



Figure 6.1 


Eeff(r-) = ^3 - V'{r), (6.11) 

mr" 3 

which agrees with F e g = — lV ff (r), as it should. 

This “effective” potential concept is a marvelous result and should be duly ap¬ 
preciated. It says that if we want to solve a two-dimensional problem (which could 
have come from a three-dimensional problem) involving a central force, we can re¬ 
cast the problem into a simple one-dimensional problem with a slightly modified 
potential. We can forget that we ever had the variable 9, and we can solve this 
one-dimensional problem (as we’ll demonstrate below) to obtain r(t). Having found 
r(t), we can use 9(t) = L/mr 2 to solve for 9(t) (in theory, at least). 

Note that this whole procedure works only because there is a quantity involv¬ 
ing r and 6 that is independent of time. The variables r and 9 are therefore not 
independent, so the problem is really one-dimensional instead of two-dimensional. 

To get a general idea of how r behaves with time, we simply have to graph 
T4ff(f)- Consider the example where V(r) = Ar 2 . This is the potential for a spring 
with equilibrium length zero. Then 


^eff(r) = + Ar 2 . (6.12) 

To graph 14ff( r )> we must be given L and A. But the general shape looks like the 
curve in Fig. 6.1. The energy E (which must be given, too) is also drawn. The 
coordinate r will bounce back and forth between the turning points, r\ and r 2 , which 
satisfy = A. 4 If E equals the minimum of V e ff(r). then rq = r 2 , so r is 

stuck at this one value, which means that the motion is a circle. Note that it is 
impossible for E to be less than the minimum of V e g. 

Remark: The L 2 /2mr 2 term in the effective potential is sometimes called the angular 
momentum barrier. It has the effect of keeping the particle from getting too close to the 
origin. Basically, the point is that L = mr 2 9 is constant, so as r gets smaller, 6 gets bigger. 
But 6 increases at a greater rate than r decreases, due to the square of the r in L = mr 2 9. 
So eventually we end up with a tangential kinetic energy, mr 2 6 2 / 2, that is greater than 
what is allowed by conservation of energy. 5 

4 It turns out that for our Ar 2 spring potential, the motion in space is an ellipse, with semi-axis 
lengths n and r 2 (see Problem 5). But for a general potential, the motion isn’t so nice. 

5 If V(r) goes to —oo faster than — 1/r 2 , then this argument doesn’t hold. You can see this by 
drawing the graph of Vhf(r), which heads to —oo instead of +oo as r —> 0. V(r) decreases fast 
enough to compensate for the increase in kinetic energy. 
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As he walked past the beautiful belle, 

The attraction was easy to tell. 

But despite his persistence, 

He was kept at a distance 
By that darn conservation of L. £ 

Note that it is by no means necessary to introduce the concept of the effective 
potential. You can simply solve the equations of motion, eqs. (6.6), as they are. 
But introducing V e R makes it much easier to see what’s going on in a central-force 
problem. 


When using potentials, effective, 

Remember the one main objective: 

The goal is to shun 
All dimensions but one, 

And then view things with 1-D perspective. 


6.3 Solving the equations of motion 

If we want to be quantitative, we must solve the equations of motion, eqs. (6.6). 
Equivalently, we must solve their integrated forms, eqs. (6.7) and (6.9), which are 
simply the conservation of L and E statements, 

mr 2 9 = L, 

^ mf 2 + 2^2 +F(r) = E ' (6 ' 13) 

The word “solve” is a little ambiguous here, because we should specify what 
quantities we want to solve for in terms of what other quantities. There are essen¬ 
tially two things we can do. We can solve for r and 6 in terms of t. Or, we can solve 
for r in terms of 9. The former has the advantage of immediately yielding velocities 

and, of course, the information of where the particle is at time t. The latter has the 

advantage of explicitly showing what the trajectory looks like in space, even though 
we don’t know how quickly it is being traversed. We will deal mainly with this latter 
case, particularly when we discuss the gravitational force and Kepler’s Laws below. 
But let’s look at both procedures now. 

6.3.1 Finding r(t) and 9(t) 

The value of r at any point is found from eq. (6.13) to be 



To get an actual r(t ) out of this, we must be supplied with E and L (which may 
be found using the initial values of r, r, and 9). and also the function V(r). To 
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solve this differential equation, we “simply” have to separate variables and then (in 
theory) integrate: 

f . ^ = = ± fdt = (t — to). (6.15) 

We must perform this (rather unpleasant) integral on the left-hand side, to obtain 
t as a function of r. Having found t(r), we may then (in theory) invert the result 
to obtain r(f). Finally, substituting this r(t) into the relation 9 = L/mr 2 from eq. 
(6.13), we have 9 as a function of t. which we can (in theory) integrate to obtain 
*(*)■ 

The bad news about this procedure is that for most V(r)’s the integral in eq. 
(6.15) is not calculable in closed form. There are only a few “nice” potentials V(r) 
for which we can evaluate it. And even then, the procedure is a pain. 6 But the good 
news is that these “nice” potentials are precisely the ones we are most interested 
in. In particular, the gravitational potential, which goes like 1 /r and which we will 
spend most of our time with in the remainder of this chapter, leads to a calculable 
integral (the spring potential ~ r 2 does also). But never mind; we’re not going to 
apply this procedure to gravity. It’s nice to know that the procedure exists, but we 
won’t be doing anything else with it. Instead, we’ll use the following strategy. 

6.3.2 Finding r(9) 

We may eliminate the dt from eqs. (6.13) by getting the r 2 term alone on the left 
side of the second equation, and then dividing by the square of the first equation. 
The dt 2 factors cancel, and we obtain 

/ 1 dr\ 2 2 mE 1 2 mV(r) 

='£» &-■ (6 ' 16) 
At this point, we can (in theory) take a square root, separate variables, and then 
integrate to obtain 9 as a function of r. We can then (in theory) invert to obtain r 
as a function of 9. To do this, of course, we must be given the function V(r). So 
let’s now finally give ourselves a V(r) and do a problem all the way through. We’ll 
study the most important potential of all (or perhaps the second most important 
one), gravity. 7 

6.4 Gravity, Kepler’s Laws 

6.4.1 Calculation of r{9) 

Our goal in this subsection will be to obtain r as a function of 9, for a gravitational 
potential. Let’s assume that we’re dealing with the earth and the sun, with masses 

6 You can, of course, always evaluate the integral numerically. See Appendix D for a discussion 
of this. 

7 The two most important potentials in physics are certainly the gravitational and harmonic- 
oscillator ones. Interestingly, they both lead to doable integrals, and they both lead to elliptical 
orbits. 
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Mq and to, respectively. The gravitational potential energy of the earth-sun system 

V(r) — — ° . where a = GM Q m. (6.17) 

r 

In the present treatment, let us consider the sun to be bolted down at the origin of 
our coordinate system. Since M 0 to, this is approximately true for the earth-sun 
system. 8 Eq. (6.16) becomes 


/ 1 dr\ 2 2 mE 1 2m.a 

\r 2 dOJ I 2 ” r 2 + rl? ' 


(6.18) 


As stated above, we could take a square root, separate variables, integrate to find 
9(r), and then invert to find r(9). This method, although straightforward, is terribly 
messy. Let’s solve for r(9) in a slick way. 

With all the 1/r terms floating around, it might be easier to solve for 1/r instead 
of r. Using d(l/r)/d9 = —(dr/dO) jr 1 , and letting y = 1/r for convenience, eq. (6.18) 


becomes 


' dy\ 2 2 2 ma 2 mE 

.m) = -r + ^t» + -W 


(6.19) 


At this point, we could also use the separation-of-variables technique, but let’s 
continue to be slick. Completing the square on the right-hand side, we obtain 


/ dy\ 2 ( ma\ 2 2 mE 

(' de) =~{ y ~ u) 


( ma\ 2 

W • 


( 6 . 20 ) 


Defining z = y — ma/L 2 for convenience, we have 


2 



( 6 . 21 ) 


At this point, in the spirit of being slick, we can just look at this equation and 
observe that 

z = Bcos(9-9 0 ) (6.22) 

is the solution, because cos 2 x + sin 2 x = 1. 


Remark: Lest we feel guilty about not doing separation-of-variables at least once in 
this problem, let’s solve eq. (6.21) that way, too. The integral is nice and doable, and we 
have 



■z = Bcos ^(0 — 9i) + cos 1 

= Bcos(9 — 6> 0 ). * (6.23) 


8 If we want to do the problem exactly, we must use the reduced mass. This topic is discussed in 
Section 6.4.5. 
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It is customary to pick the axes so that 9o = 0, so we’ll drop the 9q from here 
on. Recalling our definition z = 1/r — ma/L 2 and also the definition of B from eq. 
(6.21), eq. (6.22) becomes 

^ = "^(l + ecosfl), (6.24) 

where 



is the eccentricity of the particle’s motion. We will see shortly exactly what e 
signifies. 

This completes the derivation of r(9) for the gravitational potential, V(r) oc 1/r. 
It was a little messy, but not unbearably painful. At any rate, we just discovered the 
basic motion of objects under the influence of gravity, which takes care of virtually 
all of the gazillion tons of stuff in the universe. Not bad for one page of work. 

Newton said as he gazed off afar, 

“From here to the most distant star, 

These wond’rous ellipses 

And solar eclipses 

All come from a 1 over r.” 


What are the limits on r in eq. (6.24)? The minimum value of r is obtained when 
the right-hand side reaches its maximum value, which is (ma/L 2 )( 1 + e). Therefore, 


L 2 

ma( 1 + e) 


(6.26) 


What is the maximum value of r? The answer depends on whether e is greater 
than or less than 1. If e < 1 (which corresponds to circular or elliptical orbits, as 
we will see below), then the minimum value of the right-hand side of eq. (6.24) is 
(ma/L 2 )(l — e). Therefore, 


L 2 

W = -77-7 (if e< 1). (6.27) 

ma( 1 - e) 

If e > 1 (which corresponds to parabolic or hyperbolic orbits, as we will see be¬ 
low), then the right-hand side of eq. (6.24) can become zero (when cos 9 = — 1/e). 
Therefore, 

r m ax = oo (if e > 1). (6.28) 


6.4.2 The orbits 


Let’s examine in detail the various cases for e. 
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• Circle (e = 0) 

If e = 0, then eq. (6.25) says that E = —ma 2 /2L 2 . The negative E simply 
means that the potential energy is more negative than the kinetic energy is 
positive. The particle is trapped in the potential well. Eqs. (6.26) and (6.27) 
give r m i n = r max = L 2 /ma. Therefore, the particle moves in a circular orbit 
with radius L 2 /mot. Equivalently, eq. (6.24) says that r is independent of 9. 
Note that it isn’t necessary to do all the work of Section 6.4.1 if we just want to 
look at circular motion. For a given L, the energy —ma 2 /2L 2 is the minimum 
value that the E given by eq. (6.13) can take. (To achieve the minimum, 
we certainly want r = 0. And you can show that minimizing the effective 
potential, L 2 /2mr 2 — a/r, yields this value for E.) If we plot V e ^(r), we have 
the situation shown in Fig. 6.2. The particle is trapped at the bottom of the 
potential well, so it has no motion in the r direction. 


jr . \ L a 



Figure 6.2 


• Ellipse (0 < e < 1) 

If 0 < e < 1, then eq. (6.25) says that —ma 2 /2L 2 < E < 0. Eqs. (6.26) and 
(6.27) give r m i n and r max . It is not obvious that the resulting motion is an 
ellipse. We will demonstrate this below. 

If we plot V eff (r), we have the situation shown in Fig. 6.3. The particle 
oscillates between r m i n and r max . The energy is negative, so the particle is 
trapped in the potential well. 

• Parabola (e = 1) 

If e = 1, then eq. (6.25) says that E = 0. This value of E implies that the 
particle barely makes it out to infinity (its speed approaches zero as r —> oo). 
Eq. (6.26) gives r min = L 2 /2ma, and eq. (6.28) gives r max = oo. Again, it is 
not obvious that the resulting motion is a parabola. We will demonstrate this 
below. 

If we plot Feff(r), we have the situation shown in Fig. 6.4. The particle does 
not oscillate back and forth in the r-direction. It moves inward (or possibly 
not, if it was initially moving outward), turns around at r m i n = L 2 /2ma, and 
then heads out to infinity forever. 

• Hyperbola (e > 1) 

If e > 1, then eq. (6.25) says that E > 0. This value of E implies that the 
particle makes it out to infinity with energy to spare. (The potential goes to 
zero as r —> oo, so the particle’s speed approaches the nonzero value y/2E/m 
as r -> oo.) Eq. (6.26) gives r min . and eq. (6.28) gives r max = oo. Again, it 
is not obvious that the resulting motion is a hyperbola. We will demonstrate 
this below. 

If we plot I4ff( r )> we have the situation shown in Fig. 6.5. As in the parabola 
case, the particle does not oscillate back and forth in the r-direction. It moves 
inward (or possibly not, if it was initially moving outward), turns around at 
r min . and then heads out to infinity forever. 





Figure 6.3 


KffV) 



Figure 6.4 


Kff ( r ) 



Figure 6.5 
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y 



Figure 6.6 



Figure 6.7 


y 



6.4.3 Proof of conic orbits 


Let’s now prove that eq. (6.24) does indeed describe the conic sections stated above. 
We will also show that the origin (the source of the potential) is a focus of the conic 
section. These proofs are straightforward, although the ellipse and hyperbola cases 
get a bit messy. 

In what follows, we will find it easier to work with cartesian coordinates. For 
convenience, let 



ma 


(6.29) 


Multiplying eq. (6.24) through by kr, and using cos 0 = x/r, gives 


k = r + ex. (6.30) 

Solving for r and squaring yields 

x 2 + y 2 = k 2 — 2 kex + e 2 x 2 . (6.31) 

Let’s look at the various cases for e. We will invoke without proof various facts 
about conic sections (focal lengths, etc.). 


• Circle (e = 0) 

In this case, eq. (6.31) becomes x 2 + y 2 = k 2 . So we have a circle of radius 
k = L 2 /ma, with its center at the origin (see Fig. 6.6). 

• Ellipse (0 < e < 1) 

In this case, eq. (6.31) may be written as (after completing the square for the 
x terms, and expending some effort) 

(x + v 2 k k 

- - q 2 6 7 ^ = where a=-— f2 ■ and b = — = . (6.32) 

This is the equation for an ellipse with its center located at (—A;e/(1 — e 2 ),0). 
The semi-major and semi-minor axes a and b, respectively, and the focal length 
is c = Va 2 ~b 2 = ke/( 1 — e 2 ). Therefore, one focus is located at the origin 
(see Fig. 6.7). Note that c/a equals the eccentricity, e. 

• Parabola (e = 1) 

In this case, eq. (6.31) becomes y 2 = k 2 — 2 kx. This may be written as 
y 2 = —2 k(x — |). This is the equation for a parabola with vertex at (k/ 2,0) 
and focal length k/2. (The focal length of a parabola written in the form 
y 2 = 4 ax is a.) So we have a parabola with its focus located at the origin (see 
Fig. 6.8). 


Figure 6.8 
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• Hyperbola (e > 1) 

In this case, eq. (6.31) may be written (after completing the square for the x 
terms) 

(x - -tzt) v 2 k k 

- -^— - 7 ^ = 1, where a=^-, and b =— F =^= . (6.33) 

a 2 b 2 e 2 - 1 ' - 1 v ’ 

This is the equation for a hyperbola with its center (defined to be the in¬ 
tersection of the asymptotes) located at ( ke/(e 2 — 1),0). The focal length is 
c = y/a 2 + b 2 = ke/(e 2 — 1). Therefore, the focus is located at the origin (see 
Fig. 6.9). Note that c/a equals the eccentricity, e. 

The impact parameter (usually denoted by the letter b) of a trajectory is 
defined to be the closest distance to the origin the particle would achieve if it 
moved in the straight line determined by its initial velocity (that is, along the 
dotted line in the Fig. 6.9). You might think that choosing the letter b here 
would cause a problem, because we already defined b in eq. (6.33). However, 
it turns out that these two definitions are identical (see Exercise 6), so all is 
well. 



Figure 6.9 


Remark: Eq. (6.33) actually describes an entire hyperbola, that is, it also describes 
a branch that opens up to the right. However, this right branch was introduced in 
the squaring operation that produced eq. (6.31). It is not a solution to the original 
equation we wanted to solve, eq. (6.30). What makes the left branch, and not the right 
branch, the relevant one? The left-right symmetry was broken when we arbitrarily 
chose a positive value for B in eq. (6.21), or equivalently, a positive value for e in 
eq. (6.25). If we had chosen B and e to be negative, then the hyperbola would be 
centered at a negative value of x and would open up to the right, as you can check. 
The result would simply be Fig. 6.9, reflected across the t/-axis. 

It turns out that the right-opening branch (or its reflection in the y- axis, depending 
on your choice of sign for e) is relevant in a certain physical situation; see Exercise 9. 

* 


6.4.4 Kepler’s Laws 

We can now, with minimal extra work, write down Kepler’s Laws. Kepler (1571- 
1630) lived prior to Newton (1642-1727). Kepler arrived at these laws via obser¬ 
vational data, which was a rather impressive feat. It was known since the time of 
Copernicus (1473-1543) that the planets move around the sun, but it was Kepler 
and Newton who first gave a quantitative description of the orbits. 

Kepler’s laws assume that the sun is massive enough so that its position is 
essentially fixed in space. This is a very good approximation, but the following 
section on reduced mass will show how to modify them and solve things exactly. 

• First Law: The planets move in elliptical orbits with the sun at one focus. 
We proved this in eq. (6.32). Of course, there are undoubtedly objects flying 
past the sun in hyperbolic orbits. But we don’t call these things planets, 
because we never see the same one twice. 
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Figure 6.10 


• Second Law: The radius vector to a planet sweeps out area at a rate that is 
independent of its position in the orbit. 

This law is nothing other than the statement of conservation of angular mo¬ 
mentum. The area swept out by the radius vector during a short period of time 
is cL4 = r{rd9)/ 2, because rdO is the base of the thin triangle in Fig. 6.10. 
Therefore, we have (using L = mr 2 Q) 


dA _ 1 _ L_ 

dt 2 V 2m ’ 

which is constant, because L is constant for a central force. 


(6.34) 


• Third Law: The square of the period of an orbit, T, is proportional to the 
cube of the semimajor-axis length, a. More precisely, 


2 47r 2 mo 3 _ 4-7r 2 a 3 

= a = GM q ’ 


(6.35) 


where M 0 is the mass of the sun. Note that the planet’s mass, m, does not 
appear in this equation. 

Proof: Integrating eq. (6.34) over the time of a whole orbit gives 


A 


LT 

2m 


(6.36) 


But the area of an ellipse is A = nab, where a and b are the semi-major and 
semi-minor axes, respectively. Squaring (6.36) and using eq. (6.32) to write 
b = a\f l — e 2 gives 


/ L 2 \ T 2 
(m(l — e 2 ) J Am ’ 


(6.37) 


We have grouped the right-hand side in this way because we may now use the 
L 2 = mak relation from eq. (6.29) to transform the term in parentheses into 
ak/( 1 — e 2 ) = aa, where a is given in eq. (6.32). But aa = ( GM Q m)a , so we 


obtain 


2 4 (GM Q ma)T 2 

7r a = --- 

4m 


(6.38) 


which gives eq. (6.35), as desired. 


These three laws describe the motion of all the planets (and asteroids, comets, 
and such) in the solar system. But our solar system is only the tip of the iceberg. 
There’s a lot more stuff out there, and it’s all governed by gravity (although New¬ 
ton’s inverse square law must be supplanted by Einstein’s General Relativity theory 
of gravitation). There’s a whole universe around us, and with each generation we 
can see and understand a little more of it, both experimentally and theoretically. In 
recent years, we’ve even begun to look for friends we might have out there. Why? 
Because we can. There’s nothing wrong with looking under the lamppost now and 
then. It just happens to be a very big one in this case. 
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As we grow up, we open an ear, 
Exploring the cosmic frontier. 

In this coming of age, 

We turn in our cage, 

All alone on a tiny blue sphere. 


6.4.5 Reduced mass 


We assumed in Section 6.4.1 that the sun is large enough so that it is only negligibly 
affected by the presence of the planets. That is, it is essentially fixed at the origin. 
But how do we solve a problem in which the masses of the two interacting bodies are 
comparable in size? Equivalently, how do we solve the earth-sun problem exactly? It 
turns out that the only modification required is a simple replacement of the earth’s 
mass with the reduced mass, defined below. The following discussion actually holds 
for any central force, not just gravity. 

The Lagrangian of a general central-force system consisting of the interacting 
masses mi and m 2 is 

T = I mi r? +^m 2 r| - V(|r!-r 2 |). (6.39) 

We have written the potential in this form, dependent only on the distance |ri — r 2 |, 
because we are assuming a central force. Let us define 


miTi + m 2 r 2 
mi + m 2 


and r = ri — r 2 . 


(6.40) 


R and r are simply the position of the center of mass and the vector between the 
masses, respectively. Invert these equations to obtain 


ri = R+-^r, and r 2 = R - -^r, (6-41) 

where M = m\ + m 2 is the total mass of the system. In terms of R and r, the 
Lagrangian becomes 


C 


Im^R+^r) +Im 2 (R--^) -V(|r 
^MR 2 + ^r 2 -V(r), 


where the reduced mass, n, is defined by 


1 


1 

-h — 


mi m 2 


(6.42) 


(6.43) 


We now note that the Lagrangian in eq. (6.42) depends on R, but not on R. 
Therefore, the Euler-Lagrange equations say that R is constant. That is, the CM 
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moves at constant velocity (this is just the statement that there are no external 
forces). The CM motion is therefore trivial, so let’s ignore it. Our Lagrangian 
therefore essentially becomes 

C -4; ^pr 2 — V(r). (6.44) 

But this is simply the Lagrangian for a particle of mass p which moves around a 
fixed origin under the influence of the potential V(r). 

For gravity, we have 

C = ^pr 2 + — (where a = GM Q m). (6.45) 


To solve the earth-sun system exactly, we therefore simply need to replace (in the 
calculation in Section 6.4.1) the earth’s mass, m, with the reduced mass, p, given 
by 


(6.46) 


1 _ 1 1 

p m M 0 

The resulting value of r in eq. (6.24) is the distance between the earth and sun. The 
earth and sun are therefore distances of (M e /M)r and ( m/M)r , respectively, away 
from the CM, from eq. (6.41). These distances are simply scaled-down versions of 
the distance r, which represents and ellipse, so we see that the earth and sun move 
in elliptical orbits (whose sizes are in the ratio M Q /m) with the CM as a focus. 
Note that the m 's that are buried in the L and e in eq. (6.24) must be changed to 
p’s. But a is still defined to be GM & m, so the m in this definition does not get 
replaced with p. 

For the earth-sun system, the p in eq. (6.46) is essentially equal to m, because 
Mq is so large. Using m = 5.98 • 10 24 kg, and M© = 1.99 • 10 3O kg, we find that p is 
smaller than m by only one part in 3 • 10 5 . Our fixed-sun approximation is therefore 
a very good one. You can show that the CM is 5 • 10 5 m from the center of the sun, 
which is well within the sun (about a thousandth of the radius). 

How are Kepler’s laws modified when we solve for the orbits exactly using the 
reduced mass? 


• First Law: The elliptical statement in the first law is still true, but with the 
CM (not the sun) located at a focus. The sun also travels in an ellipse with 
the CM at a focus. 9 Whatever is true for the earth must also be true for the 
sun, because they come into eq. (6.42) symmetrically. The only difference is 
in the size of various quantities. 

• Second Law: In the second law, we need to consider the position vector from 
the CM (not the sun) to the planet. This vector sweeps out equal areas in 
equal times, because the angular momentum of the earth (and the sun, too) 
relative to the CM is fixed. This is true because the gravitational force always 

9 Well, this statement is true only if there is just one planet. With many planets, the tiny motion 
of the sun is very complicated. This is perhaps the best reason to work in the approximation where 
it is essentially bolted down. 
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points through the CM, so the force is a central force with the CM as the 
origin. 

• Third Law: Eq. (6.45) describes a particle of mass /i moving in a potential 
of —a/r. The reasoning we used in obtaining eq. (6.35) still holds, provided 
that we change all the m’s to p’s, except the one in a e GM Q m. In other 
words, we still arrive at eq. (6.38), except with the bottom m (but not the 
top one) replaced with //. Therefore, we obtain 


47r 2 a 3 /x _ 47r 2 a 3 

GM Q m G(M q + m) ’ 


(6.47) 


where we have used n = M 0 m/(M 0 + m). Eq. (6.47) reduces to eq. (6.35) 
when n « m (that is, when M 0 S> to) , as it should. Note the symmetry 
between M e and m. 

The T in eq. (6.47) is the time for the hypothetical particle of mass /j to 
complete an orbit. But this is the same as the time for the earth (and the 
sun) to complete an orbit. So it is indeed the time we are looking for. The a in 
eq. (6.47) is the semi-major axis of the hypothetical particle’s orbit. In other 
words, it is half of the maximum distance between the earth and the sun. 
If you want to write the third law using the semi-major axis of the earth’s 
elliptical orbit, which is a e = ( M Q /M)a , then simply plug a = (M/M 0 )a e 
into eq. (6.47). 
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6.5 Exercises 

Section 6.1: Conservation of angular momentum 

1. Wrapping around a pole * 

A puck of mass m on frictionless ice is attached by a horizontal string of 
length t to a very thin vertical pole of radius R. The puck is given a kick 
and circles around the pole with initial speed vo■ The string wraps around the 
pole, and the puck gets drawn in and eventually hits the pole. What quantity 
is conserved during the motion? What is the puck’s speed right before it hits 
the pole? 

Section 6.2: The effective potential 

2. Power-law spiral ** 

Given L, find the form of V (r) so that the path of a particle is given by the 
spiral r = C6 k , where C and k are constants. Hint: Obtain an expression for 
r that contains no 0’s, and then use eq. (6.9). 

Section 6 . 4 .: Gravity, Kepler’s Laws 

3. Circular orbit * 

For a circular orbit, derive Kepler’s third law from scratch, using F = ma. 

4. Falling into the sun * 

Imagine that the earth is suddenly (and tragically) stopped in its orbit, and 
then allowed to fall radially into the sun. How long will this take? Use data 
from Appendix J. Hint: Consider the radially path to be half of a very thin 
ellipse. 

5. Closest approach ** 

A particle with speed no and impact parameter b starts far away from a planet 
of mass M. 

(a) Starting from scratch (that is, without using any of the results from 
Section 6.4), find the distance of closest approach to the planet. 

(b) Use the results of the hyperbola discussion in Section 6.4.3 to show that 
the distance of closest approach to the planet is k/(e+ 1), and then show 
that this agrees with your answer to part (a). 

6. Impact parameter ** 

Show that the distance b defined in eq. (6.33) and Fig. 6.9 is equal to the 
impact parameter. Do this: 

(a) Geometrically, by showing that b is the distance from the origin to the 
dotted line in Fig. 6.9. 
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(b) Analytically, by letting the particle come in from infinity at speed no and 
impact parameter b and then showing that the b in eq. (6.33) equals b'. 

7. Skimming a planet ** 

A particle travels in a parabolic orbit in a planet’s gravitational field and skims 
the surface at its closest approach. The planet has mass density p. Relative 
to the center of the planet, what is the angular velocity of the particle as it 
skims the surface? 

8. Parabola L ** 

Consider a parabolic orbit of the form y = x 2 f (4T), which has focal length 
I. Let the speed at closest approach be vo- The angular momentum is then 
mvoi. Show explicitly (by finding the speed and the “lever arm”) that this is 
also the angular momentum when the particle is very far from the origin (as 
it must be, because L is conserved). 

9. Repulsive potential ** 

Consider an “anti-gravitational” potential (or more mundanely, the electro¬ 
static potential between two like charges), 

V(r) = — , where a > 0. (6.48) 

What is the basic change in the analysis of Section 6.4.3? Draw the figure 
analogous to Fig. 6.9 for the hyperbolic orbit. Show that circular, elliptical, 
and parabolic orbits do not exist. 

10. Ellipse axes ** 

Taking it as given that eq. (6.24) describes as ellipse for 0 < e < 1, calculate 
the lengths of the semi-major and semi-minor axes, and show that your results 
agree with eq. (6.32). 

11. Zero potential ** 

A particle is subject to a constant potential, which we will take to be zero. Fol¬ 
lowing the general strategy in Sections 6.4.1 and 6.4.3, show that the particle’s 
path is a straight line. 
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6.6 Problems 

Section 6.2: The effective potential 

1. Maximum L *** 

A particle moves in a potential V(r) = — Voe~ x2r2 . 

(a) Given L, find the radius of the stable circular orbit. An implicit equation 
is fine here. 

(b) It turns out that if L is too large, then a circular orbit actually doesn’t 
exist. What is the largest value of L for which a circular orbit does indeed 
exist? What is the value of V^jj(r) in this case? 

2. Cross section ** 

A particle moves in a potential V(r) = —(7/(3r 3 ). 

(a) Given L, find the maximum value of the effective potential. 

(b) Let the particle come in from infinity with speed no and impact parameter 
b. In terms of C, m, and no, what is the largest value of b (call it 6 max ) 
for which the particle is captured by the potential? In other words, what 
is the “cross section” for capture, nb^^, for this potential? 

3. Exponential spiral ** 

Given L, find the form of V (r) so that the path of a particle is given by the 
spiral r = Ae ad , where A and a are constants. Hint: Obtain an expression for 
r that contains no 0’s, and then use eq. (6.9). 

Section 6 . 4 .: Gravity, Kepler’s Laws 

4. r k potential *** 

A particle of mass m moves in a potential given by V(r) = (3r k . Let the 
angular momentum be L. 

(a) Find the radius, ro, of a circular orbit. 

(b) If the particle is given a tiny kick so that the radius oscillates around ro, 
find the frequency, u r , of these small oscillations in r. 

(c) What is the ratio of the frequency u r to the frequency of the (nearly) 
circular motion, ug . 61 Give a few values of k for which the ratio is 
rational, that is, for which the path of the nearly circular motion closes 
back on itself. 

5. Spring ellipse *** 

A particle moves in a V ( r ) = fir 2 potential. Following the general strategy in 
Sections 6.4.1 and 6.4.3, show that the particle’s path is an ellipse. 
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6. /?/r 2 potential *** 

A particle is subject to a V ( r ) = (3/r 2 potential. Following the general strategy 
in Section 6.4.1, find the shape of the particle’s path. You will need to consider 
various cases for j3. 

7. Rutherford scattering *** 

A particle of mass m travels in a hyperbolic orbit past a mass M, whose 
position is assumed to be fixed. The speed at infinity is vq, and the impact 
parameter is b (see Exercise 6). 

(a) Show that the angle through which the particle is deflected is 

4> = 7T — 2tan -1 (76) ==> b = - cot , where 7 = . (6.49) 

'y \ 2 ) GM 

(b) Let da be the cross-sectional area (measured when the particle is initially 

at infinity) that gets deflected into a solid angle of size dLl at angle r/). 10 
Show that ^ 1 

dLl 4y 2 sin 2 (^)/2) ^ ^ 

This quantity is called the differential cross section. The term Rutherford 
scattering actually refers to the scattering of charged particles, but since 
the electrostatic and gravitational forces are both inverse-square laws, 
the scattering formulas look the same, except for a few constants. 


10 The solid angle of a patch on a sphere is the area of the patch divided by the square of the 
sphere’s radius. So a whole sphere subtends a solid angle of 47r steradians (the name for one unit 
of solid angle). 
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Kff(r) 



Figure 6.11 


Kff( r ) 



Figure 6.12 


KtrCr) 



Figure 6.13 


6.7 Solutions 


1. Maximum L 


(a) The effective potential is 


V eff (r) = 


ii ' 

2 mr 2 


V 0 e~ x2r2 . 


(6.51) 


A circular orbit exists at the value(s) of r for which V^ s (r) = 0. Setting the 
derivative equal to zero and solving for L 2 gives, as you can show, 


L 2 = (2mV 0 X 2 )r 4 e~ x2r2 . (6.52) 


This implicitly determines r. As long as L isn’t too large, 14g(r) looks something 
like the graph in Fig. 6.11 (although it doesn’t necessarily dip down to negative 
values; see the remark below), so there are two solutions for r. The smaller 
solution is the one with the stable orbit. However, if L is too large, then there 
are no solutions to lV ff (r) = 0, because V e s(r) decreases monotonically to zero 
(because L 2 /2mr 2 does so). We’ll be quantitative about this in part (b). 

(b) The function r A e~ x r on the right-hand side of eq. (6.52) has a maximum 
value, because it goes to zero for both r —> 0, and r —> oo. Therefore, there 
is a maximum value of L for which a solution for r exists. The maximum of 
r 4 e~ x r occurs when 


{r A e~ x2r2 y = e~ x2r2 [4r 3 + r 4 (-2A 2 r)] =0 ^ 

r 2 = ^=rl (8.53) 

Plugging ro into eq. (6.52) gives 


2 8mVo 

max ~ A 2 ^ ' 

(6.54) 

Plugging ro and L 2 nax into (6.51) gives 


Veff(ro) = ^ (for L = L max 

). (6.55) 


Note that this is greater than zero. For the L = L rnax case, the graph of V e s is 
shown in Fig. 6.12. This is the cutoff case between having a dip in the graph, 
and decreasing monotonically to zero. 


Remark: A common error in this problem is to say that the condition for a circular 
orbit to exist is that V e s{r) < 0 at the point where 14ft (r) is minimum. The logic here 
is that the goal is to have a well in which the particle can be trapped, so it seems like 
we just need V e s to achieve a value less than the value at r = oo, namely 0. However, 
this gives the wrong answer (L^ = 2mVo/A 2 e, as you can show), because 14ft(r) can 
look like the graph in Fig. 6.13. This has a local minimum with 14e(r) > 0. £ 

2. Cross section 


(a) The effective potential is 

<6 ' 56) 

Setting the derivative equal to zero gives r = mC/L 2 . Plugging this into V e ff(f) 
gives 

eff 6m 3 C 2 ‘ 


(6.57) 
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VcB(r) 


(b) If the energy of the particle, E, is less than V^ ax , then the particle will reach 
a minimum value of r, and then head back out to infinity (see Fig. 6.14). 
If E is greater than VJJf a,x , then the particle will head in to r = 0, never to 
return. The condition for capture is therefore V^ ax < E. Using L = mvob and 
E = E a0 = rriv}j2, this condition becomes 


(mvob) 6 mvQ 

6m 3 C 2 < ~~2~~ 



The cross section for capture is therefore 



(6.59) 


It makes sense that this should increase with C and decrease with m and vq. 


3. Exponential spiral 

The given information r = Ae a6 yields (using 6 = L/mr 2 ) 

. a0 a ( L \ aL 
r = aAe v = or I —^ 1 = — . 


(6.60) 


Plugging this into eq. (6.9) gives 


m f aL\ L 2 Tr , . _ 

~2 yrnr) + 2 ^ + V ^ = K 


(6.61) 


V(r) = E • 


(1 + a 2 )L 2 


(6.62) 


The total energy, E, may be arbitrarily chosen to equal zero, if desired. 

4. r k potential 


(a) A circular orbit exists at the value of r for which the derivative of the effective 
potential (which is the negative of the effective force) is zero. This is simply the 
statement that the right-hand side of eq. (6.8) equals zero, so that r = 0. Since 
V'(r) = ftkr*- 1 , eq. (6.8) gives 


- pkr k 1 = 0 


ro 


/ L 2 y/(wi 

\m/3k ) 


(6.63) 


Note that if k is negative, then (3 must also be negative if there is to be a real 
solution for ro- 

(b) The long method of finding the frequency is to set r(f) = r'o + e(f), where e 
represents the small deviation from the circular orbit, and to then plug this 
expression for r into eq. (6.8). The result (after making some approximations) 
is a harmonic-oscillator equation of the form e = —u> 2 e. This general proce¬ 
dure, which is described in detail in Section 5.7, will work fine here (as you are 
encouraged to show), but let’s use an easier method. 



Figure 6.14 





VI-22 


CHAPTER 6. CENTRAL FORCES 


By introducing the effective potential, we have reduced the problem to a one¬ 
dimensional problem in the variable r. Therefore, we can make use of the result 
in Section 4.2, where we found in eq. (4.15) that to find the frequency of small 
oscillations, we simply need to calculate the second derivative of the potential. 
For the problem at hand, we must use the effective potential, because that is 
what determines the motion of the variable r. We therefore have 


(6.64) 


If you work through the r = ro + e method described above, you will find 
that you are basically calculating the second derivative of V e g, but in a rather 
cumbersome way. 

Using the form of the effective potential, we have 


y eff(r-o) = ^4 + 0k(k - l)r£“ 2 

= li(^ + (3k{k- l)r 0 fe+2 ). 

Using the ro from eq. (6.63), this simplifies to 

_ L 2 (k + 2) _ / V%(r 0 ) _ Ly/k + 2 


Kff(ro) = - 


(6.65) 


( 6 . 66 ) 


We could get rid of the ro here by using eq. (6.63), but this form of ui r will be 
more useful in part (c). 

Note that we must have k > — 2 for oj r to be real. If k < —2, then 17" (r 0 ) < 0, 
which means that we have a local maximum of V e g, instead of a local minimum. 
In other words, the circular orbit is unstable. Small perturbations grow, instead 
of oscillating around zero. 

(c) Since L = mr /6 for the circular orbit, we have 


uo 


I 



(6.67) 


Combining this with eq. (6.66), we find 


( 6 . 68 ) 


A few values of k that yield rational values for this ratio are (the plots of the 
orbits are shown below): 

• k = — 1 => ui r /toe = This is the gravitational potential. The variable r 
makes one oscillation for each complete revolution of the (nearly) circular 
orbit. 

• k = 2 => uJ r /uJo = 2: This is the spring potential. The variable r makes 
two oscillations for each complete revolution. 

• k = 7 => u> r /u>e = 3: The variable r makes three oscillations for each 
complete revolution. 

• k = — 7/4 => u> r /u)o = 1/2: The variable r makes half of an oscillation for 
each complete revolution. So we need to have two revolutions to get back 
to the same value of r. 




There is an infinite number of k values that yield closed orbits. But note that 
this statement applies only to orbits that are nearly circular. The “closed” 
nature of the orbits is only approximate, because it is based on eq. (6.64) which 
is an approximate result based on small oscillations. The only k values that 
lead to exactly closed orbits for any initial conditions are k = — 1 (gravity) and 
k = 2 (spring), and in both cases the orbits are ellipses. This result is known as 
Bertrand’s Theorem. 


5. Spring ellipse 

With V(r) = (dr 2 , eq. (6.16) becomes 


/ 1 dr \ 2 _ 2 mE 
\r 2 dOJ %* 


l_ _ 2 m/3r 2 

T 2 I? 


(6.69) 


As stated in Section 6.4.1, we could take a square root, separate variables, integrate 
to find 0(r), and then invert to find r(0). But let’s solve for r(6) in a slick way, as we 
did for the gravitational case, where we made the change of variables, y = 1/r. Since 
there are lots of r 2 terms floating around in eq. (6.69), it is reasonable to try the 
change of variables, y = r 2 or y = 1/r 2 . The latter turns out to be the better choice. 
So, using y = 1/r 2 and dy/d6 = —2 (dr/d0)/r 3 , and multiplying eq. (6.69) through 
by 1/r 2 , we obtain 


Defining 


= y- 


/'1 dy\ 2 _ 2 mEy 2 2 m(3 

\2d0j = V 

( mE\ 2 2 m/3 (mE\ 

mE/L 2 for convenience, we have 


S) 


'(f) 


m 2 \ 

mE 2 ) 


(6.70) 


(6.71) 


As in Section 6.4.1, we can just look at this equation and observe that 


^ = B cos 2(0 — 0o) 


(6.72) 


is the solution. We can rotate the axes so that 0o = 0, so we’ll drop the 0q from here 
on. Recalling our definition z = 1/r 2 — mE/L 2 and also the definition of B from eq. 
(6.71), eq. (6.72) becomes 


e cos 20 ), 


(6.73) 
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y 



Figure 6.16 


= A m 2 

e ~V mE 2 ‘ 


(6.74) 

It turns out, as we will see below, that e is not the eccentricity of the ellipse, as it was 
in the gravitational case. 

We will now use the procedure in Section 6.4.3 to show that eq. (6.74) represents an 
ellipse. For convenience, let 

, L 2 


mE ' 


(6.75) 


Multiplying eq. (6.73) through by kr 2 , and using 

cos 26> = cos 2 6 - sin 2 6 = % - % , (6.76) 

and also r 2 = x 2 + y 2 , we obtain k = (x 2 + y 2 ) + e(x 2 — y 2 ). This can be written as 

(6.77) 


where a = \l , and b = \ — • 

V 1 + e V 1-e 


This is the equation for an ellipse with its center located at the origin (as opposed to 
its focus located at the origin, as it was in the gravitational case). The s emi-maj or 
and semi-minor axes are b and a, respectively, and the focal length is c — s/b 2 — a 2 = 
i/2fce/(l — e 2 ) (see Fig. 6.16). The eccentricity is c/6 = i/2e/(l + e). 


Remark: If e = 0, then a = b, which means that the ellipse is actually a circle. Let’s see if 
this makes sense. Looking at eq. (6.74), we see that we want to show that circular motion 
implies 2/3L 2 = mE 2 . For circular motion, the radial F = ma equation is mv 2 /r = 2 fir => 
v 2 = 2 j3r 2 /m. The energy is therefore E = mv 2 / 2 + fir 2 = 2 /3r 2 . Also, the square of the 
angular momentum is L 2 = m 2 v 2 r 2 = 2m(3r i . Therefore, 2f3L 2 = 2/3(2m/3r 4 ) = m(2/3r 2 ) 2 = 
mE 2 , as we wanted to show. Jit 

6. /3/r 2 potential 

With V(r) = (3/r 2 , eq. (6.16) becomes 


/ 1 dr\ 2 _ 2 mE 1 2 m/3 

y^de) - i^r _ /U2 

_ 2 mE 1 / 2mf3\ 

W - v + ) ' 


Letting y = 1/r, this becomes 


/ dy \ 2 22 2 ml 

Vd6j +aV = ~L? 


where 


2 = 2 mp 

-1+ ^ 


(6.78) 


(6.79) 


We must now consider various possibilities for a 2 . These possibilities depend on how 
/3 compares to L 2 (which depends on the initial conditions of the motion). In what 
follows, note that the effective potential equals 




a 2 L 2 


(6.80) 



6.7. SOLUTIONS 


VI-25 


• a 2 > 0, or equivalently, 0 > —L 2 /2m: In this case, the effective potential looks 
like the graph in Fig. 6.17. The solution for y in eq. (6.79) is a trig function, 
which we will take to be a “sin” by appropriately rotating the axes. Using 
y = 1/r, we obtain 


(6.81) 


0 = 0 and 0 = n/a make the right-hand side equal to zero, so they correspond to 
r = oo. And 0 = tt/ 2a makes the right-hand side maximum, so it corresponds 
to the minimum value of r, which is r m i n = ci\/L' 2 /2mE. This minimum r can 
also be obtained in a much quicker manner by finding where V e s(r) = E. 

If the particle comes in from infinity (at 0 = 0), we see that it eventually heads 
back out to infinity (at 0 = 7r/a). The angle that the incoming path makes with 
the outgoing path is therefore 7r /a. So if a is large (that is, if 0 is large and 
positive, or if L is small), then the particle bounces nearly straight backwards. If 
a is small (that is, if 0 is negative, and if L 2 is only slightly larger than —2m/3), 
then the particle spirals around many times before popping back out to infinity. 
A few special cases are: (1) 0 = 0 ==> a = 1, which means that the total angle is -k, 
that is, there is no net deflection. In fact, the particle’s path is a straight line, because 
the potential is zero; see Exercise 11 . ( 2 ) L 2 = — 8m0/3 ==> a = 1 / 2 , which means 
that the total angle is 27 t, that is, the particle eventually comes back out along the 
same line that it went in. 

• a = 0, or equivalently, 0 = —L 2 /2m: In this case, the effective potential is 
identically zero, as shown in Fig. 6.18. Eq. (6.79) becomes 


fdy\ 2 _ 2 mE 
\dOj ~ L 2 ' 


(6.82) 


The solution to this is y = 0\j2rnE/L 2 + C, which gives 



Figure 6.17 


. E 

fV t{i (r) = 6 


Figure 6.18 


. 1 / L 2 

" 0 V 2 mE ’ 


(6.83) 


where we have set the integration constant, C, equal to zero by choosing 0 = 0 
to be the angle that corresponds to r = oo. Note that we can use 0 = —L 2 /2m 
to write r as r = \J—0/E/d. 

Since the effective potential is flat, the rate of change of r is constant. If the 
particle has r < 0, it will therefore reach the origin in finite time, even though 
eq. (6.83) say that it will spiral around the origin an infinite number of times 
(because 0 —> oo as r —> 0). 


• a 2 < 0, or equivalently, 0 < —L 2 /2m: In this case, the effective potential looks 
like the graph in either Fig. 6.19 or Fig. 6.20, depending on the sign of E. For 
convenience, let b be the positive real number such that b 2 = —a 2 . Then eq. 
(6.79) becomes 


( ^ 

\d6 


- b 2 y 2 = - 


(6.84) 


The solution to this equation is a hyperbolic trig function. But we must consider 
two cases: 




E 


1 V ef f(r) ~ -1/r 2 

Figure 6.20 
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(a) E > 0: Using the identity cosh 2 z — sinh 2 z = 1, and recalling y = 1/r, we 
see that the solution to eq. (6.84) is 11 


1 1 2mE 


(6.85) 


Unlike the a 2 > 0 case above, the sinh function has no maximum value. 
Therefore, the right-hand side can head to infinity, which means that r can 
head to zero. Note that for large z, we have sinh 2 w e z / 2 . So r heads to 
zero like e~ be , in other words, exponentially quickly. 

(b) E < 0: In this case, eq. (6.84) can be rewritten as 


2 2 _ MA 2 _ 2 ™\ E \ 
y \de) L 2 


( 6 . 86 ) 


The solution to this equation is 12 

1 _ 1 l2m\E\ 


(6.87) 


As in the sinh case, the cosh function has no maximum value. Therefore, the 
right-hand side can head to infinity, which means that r can head to zero. 
But in the present cosh case, the right-hand side does achieve a nonzero 
minimum value, when 9 = 0. So r achieves a maximum value (this is clear 
from Fig. 6.20) equal to r max = byjL 2 /2m\E\. This maximum r can also 
be obtained by simply finding where V e g(r) = E. After reaching r max , the 
particle heads back down to the origin. 


7. Rutherford scattering 


(a) From Exercise 6 , we know that the impact parameter, b, equals the distance b 
shown in Fig. 6.9. Therefore, Fig. 6.21 tells us that the angle of deflection (the 
angle between the initial and final velocity vectors) is 



( 6 . 88 ) 


But from eqs. (6.33) and (6.25), we have 


/ 2 (m^Q / 2 ) (mvob) 2 _ v%b 
~ GM ' 


(6.89) 


Substituting this into eq. ( 6 . 88 ), with 7 = Vq/(GM), gives the first expression 
in eq. (6.49). Dividing by 2 and taking the cotangent of both sides then gives 
the second expression, 

b= ^ cot (i)- ( 6 - 9 °) 

11 More generally, we should write sinh(0 — 9o) here. But we can eliminate the need for 60 by 
picking 9 = 0 to be the angle that corresponds to r = 00 . 

12 Again, we should write cosh(0 — 9o) here. But we can eliminate the need for 60 by picking 9 = 0 
to be the angle that corresponds to the maximum value of r. 
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Note that it actually isn’t necessary to go through all the work of Section 6.4.3 
to obtain this result, by determining a and b. We can simply use eq. (6.24), 
which says that r —> oo when cos 6 —> — 1/e. This then implies that the dotted 
lines in Fig. 6.21 have slope tan# = \J sec 2 6 — 1 = y/e 2 — 1, which reproduces 
eq. (6.89). 

(b) Imagine a wide beam of particles moving in the positive x-direction, toward 
the mass M. Consider a thin cross-sectional ring in this beam, with radius b 
and thickness db. Now consider a large sphere centered at M. Any particle 
that passed through the cross-sectional ring of radius b will hit this sphere in a 
ring located at an angle </> relative to the x-axis, with an angular spread of d<l>. 
The relation between db and d<f) is found from eq. (6.90). Using <i(cot 6 )jdft = 
-1/sin 2 /?, we have 

\db\ 1 

d4> I ' ‘ 

The area of the incident cross-sectional ring is da = 2nb \db\. What is the solid 
angle subtended by a ring at angle <j> with thickness dtjfl Taking the radius of the 
sphere to be R (which will cancel out), the radius of the ring is R sin </>, and the 
linear thickness is R\d<f>\. The area of the ring is therefore 27r(i?sin 0 )(R,\d<p\). 
and so the solid angle subtended by the ring is dfl = 2 tt sin <j> \d<p\ steradians. 
Therefore, the differential cross section is 

da _ 2 Trb\db\ 
dfl 2 tt sin cf) \d(/)\ 


/ b 

\sin <j)) \d<j>\ 

( (1/7) cot ((ft/2) \ / 1 \ 

V2sin(</>/2)cos(</>/2) J V.2- shrio 2) / 
1 

4 q 2 sin 4 (^>/2) 


Remarks: What does this “differential cross section” result tell us? It tells us that 
if we want to find out how much cross-sectional area gets mapped into the solid angle 
dfl at the angle <j>, then we can simply use eq. (6.92) to say (recalling 7 s Vq/(GM)), 


i 4 (0/2) 


(6.93) 


Let’s look at some special cases. If 0 « 180° (that is, backward scattering), then the 
amount of area that gets scattered into a nearly backward solid angle of dfl equals 
da = (G 2 M 2 /4 vq) dfl. If vo is small, then we see that da is large, that is, a large 
area gets deflected nearly straight backwards. This makes sense, because with vo ® 0, 
the orbits are essentially parabolic, which means that the initial and final velocities 
at infinity are (anti)parallel. (If you release a particle from rest far away from a 
gravitational source, it will come back to you. Assuming it doesn’t bump into the 
source, of course.) If vo is large, then we see that da is small, that is, only a small area 
gets deflected backwards. This makes sense, because the particles are more likely to 
fly past M without any deflection if they are moving fast, because the force has less 
time to act. 

Another special case is <j> « 0° (that is, there is negligible deflection). In this case, 
eq. (6.93) tells us that the amount of area that gets scattered into a nearly forward 
solid angle of dfl equals da « 00 . This makes sense, because if the impact parameter 
is large (and there is an infinite cross-sectional area for which this is true), then the 
particle will hardly feel the mass M, and will therefore continue to move essentially in 
a straight line. 



VI-28 


CHAPTER 6. CENTRAL FORCES 


What if we consider the electrostatic force, instead of the gravitational force? What 
is the differential cross section is that case? To answer this, note that we may rewrite 
7 as 


^=7 


(6.94) 


2(mvl/2) _ 2E 
' GM ~ GMm ~ a ' 

In the case of electrostatics, the force takes the form, F e = kqiq^/r 2 . This looks like the 
gravitational force, F g p Gmimi/r 2 , except that the constant a is now kqiq 2 , instead 
of GtoiTO 2. Therefore, the 7 in eq. (6.94) becomes 7 e = 2E/(kqiq2). Substituting this 
into eq. (6.92), we see that the differential cross section for electrostatic scattering is 


dfi 


_ hiAlfai. _ 


(6.95) 


This is the Rutherford scattering differential cross section formula. Around 1910, 
Rutherford and his students bombarded metal foils with alpha particles. Their re¬ 
sults for the distribution of scattering angles were consistent with the above formula. 
In particular, they observed back-scattering of the alpha particles. Since the above 
formula is based on the assumption of a point source for the potential, this led Ruther¬ 
ford to his theory that atoms contained a dense positively-charged nucleus, as opposed 
to being made of a spread-out “plum pudding” distribution of charge, which (as a 
special case of not yielding the correct distribution of scattering angles) doesn’t yield 
back-scattering, ft 
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Chapter 7 

Angular Momentum, Part I 
(Constant L) 

Copyright 2004 by David Morin, morin@physics.harvard.edu 


The angular momentum of a point mass, relative to a given origin, is 

L = r x p. (7.1) 

For a collection of particles, the total L is simply the sum of the L’s of each particle. 

The quantity r x p is a useful thing to study because it has many nice properties. 
One of these is the conservation law presented in Theorem 6.1, which allowed us to 
introduce the “effective potential” in Section 6.2. And later in this chapter we will 
introduce the concept of torque , r, which appears in the bread-and-butter statement, 
t = dL/dtt (analogous to Newton’s F = dp/dt law). 

There are two basic types of angular momentum problems in the world. Since 
the solution to any rotational problem invariably comes down to using t = dh/dt, 
we must determine how L changes in time. And since L is a vector, it can change 
because (1) its length changes, or (2) its direction changes (or through some com¬ 
bination of these effects). In other words, if we write L = LL, where L is the 
unit vector in the L direction, then L can change because L changes, or because L 
changes, or both. 

The first of these cases, that of constant L, is the easily understood one. Consider 
a spinning record. The vector L = x P is perpendicular to the record. If we 
give the record a tangential force in the proper direction, then it will speed up (in 
a precise way which we will soon determine). There is nothing mysterious going on 
here. If we push on the record, it goes faster. L points in the same direction as 
before, but it now simply has a larger magnitude. In fact, in this type of problem, 
we can completely forget that L is a vector. We can just deal with its magnitude 
L, and everything will be fine. This first case is the subject of the present chapter. 

The second case however, where L changes direction, can get rather confusing. 
This is the subject of the following chapter, where we will talk about gyroscopes, 
tops, and other such spinning objects that have a tendency to make one’s head spin 
also. In these situations, the entire point is that L is actually a vector. And unlike 
in the constant-L case, we really have to visualize things in three dimensions to see 
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what’s going on. 1 

The angular momentum of a point mass is given by the simple expression in 
eq. (7.1). But in order to deal with setups in the real world, which invariably 
consist of many particles, we must learn how to calculate the angular momentum 
of an extended object. This is the task of the Section 7.1. We will deal only with 
motion in the x-y plane in this chapter. Any rotations we talk about will therefore 
be around the z-axis (or an axis parallel to the z-axis). We’ll save the general 3-D 
case for Chapter 8. 


7.1 Pancake object in x-y plane 

Consider a flat, rigid body undergoing arbitrary motion (both translating and spin¬ 
ning) in the x-y plane; see Fig. 7.1. What is the angular momentum of this body, 
relative to the origin of the coordinate system? 2 

If we imagine the body to consist of particles of mass m;, then the angular 
momentum of the entire body is the sum of the angular momenta of each m*, which 
are Lj = r* x pj. So the total angular momentum is 

L = Er t xp, (7.2) 


For a continuous distribution of mass, we would have an integral instead of a sum. 

L depends on the locations and momenta of the masses. The momenta in turn 
depend on how fast the body is translating and spinning. Our goal here is to find 
the dependence of L on the distribution and motion of its constituent masses. The 
result will involve the geometry of the body in a specific way, as we will show. 

In this section, we will deal only with pancake-like objects that move in the x-y 
plane (or simple extensions of these). We will find L relative to the origin, and we 
will also derive an expression for the kinetic energy. We will deal with non-pancake 
objects in Section 7.2. 

Note that since both r and p for our pancake-like objects always lie in the x-y 
plane, the vector L = r x p always points in the z direction. This fact is what makes 
these pancake cases easy to deal with; L changes only because its length changes, 
not its direction. So when we eventually get to the r = dL/dt equation, it will take 
on a simple form. 

Let’s first look at a special case, and then we’ll look at general motion in the x-y 
plane. 


1 The difference between these two cases is essentially the same as the difference between the two 
basic F = dp/dt cases. The vector p can change simply because its magnitude changes, in which 
case we have F = ma. Or, p can change because its direction changes, in which case we have the 
centripetal-acceleration statement, F = mv 2 /r. (Or, there could be a combination of these effects.) 
The former case seems a bit more intuitive than the latter. 

2 Remember, L is defined relative to a chosen origin (because it has the vector r in it), so it 
makes no sense to ask what L is, without specifying what origin you’ve chosen. 
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7.1.1 Rotation about the 2 -axis 

The pancake in Fig. 7.2 rotates with angular speed u around the 2 -axis, in the 
counterclockwise direction (as viewed from above). Consider a little piece of the 
body, with mass dm and position ( x,y ). Let r = \Jx 2 + y 2 . This little piece travels 
in a circle around the origin with speed v = ur. Therefore, the angular momentum 
of this piece (relative to the origin) is equal to L = r x p = r(ydm) z = dmr 2 u> z. 
The z direction arises from the cross product of the (orthogonal) vectors r and p. 
The angular momentum of the entire body is therefore 



= J(x 2 + y 2 )uz. dm, (7.3) 

where the integration runs over the area of the body. If the density of the object is 
constant, as is usually the case, then we have dm = pdx dy. If we define the moment 
of inertia around the 2 -axis to be 

I z = Jr 2 dm = J (x 2 + y 2 ) dm, (7.4) 

then the 2 -component of L is 

L z = I z u, (7.5) 

and L x and L y are both equal to zero. In the case where the rigid body is made 
up of a collection of point masses to* in the x-y plane, the moment of inertia in eq. 
(7.4) simply takes the discretized form, 

(™) 

Given any rigid body in the x-y plane, we can calculate I z . And given uj, we 
can then multiply it by I z to find L z . In Section 7.3.1, we will get some practice 
calculating various moments of inertia. 

What is the kinetic energy of our object? We need to add up the energies of 
all the little pieces. A little piece has energy dmv 2 / 2 = dm(ru) 2 / 2. So the total 
kinetic energy is 

T = f r -^-dm. (7.7) 

With our definition of I z in eq. (7.4), we have 




Figure 7.2 


This is easy to remember, because it looks a lot like the kinetic energy of a point 
mass, mv 2 / 2. 
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Figure 7.4 


7.1.2 General motion in x-y plane 

How do we deal with general motion in the x-y plane? For the motion in Fig. 7.3, 
where the object is both translating and spinning, the various pieces of mass do not 
travel in circles around the origin, so we cannot write v = ur as we did above. 

It turns out to be highly advantageous to write the angular momentum, L, and 
the kinetic energy, T, in terms of the center-of-mass (CM) coordinates and the 
coordinates relative to the CM. The expressions for L and T take on very nice forms 
when written this way, as we now show. 

Let the coordinates of the CM be R = ( X , Y), and let the coordinates of a given 
point relative to the CM be r' = (x', y'). Then the given point has coordinates 
r = R + r' (see Fig. 7.4). Let the velocity of the CM be V, and let the velocity 
relative to the CM be V. Then v = V + v'. Let the body rotate with angular speed 
u/ around the CM (around an instantaneous axis parallel to the 2 -axis, so that the 
pancake remains in the x-y plane at all times). 3 Then v' = u'r'. 

Let’s look at L first. The angular momentum relative to the origin is 

L = J r xv dm 

= J (R + r') x (V + v') dm 

= MR x V + /r'xv' dm (cross terms vanish; see below) 

= MR x V + (^j r' 2 Jdm ^ z 

ee MR x V + (7 2 CM u/) z, (7.9) 


where M is the mass of the pancake. In going from the second to the third line 
above, the cross terms, Jr'xV dm and / Rxv' dm, vanish by definition of the CM, 
which says that / r' dm = 0 (see eq. (4.69)), and hence / v' dm = d(f r' dm)/dt = 0. 
The quantity / 2 M is the moment of inertia around an axis through the CM, parallel 
to the z-axis. Eq. (7.9) is a very nice result, and it is important enough to be called 
a theorem. In words, it says: 


Theorem 7.1 The angular momentum (relative to the origin) of a body can be 
found by treating the body as a point mass located at the CM and finding the angular 
momentum of this point mass relative to the origin, and by then adding on the 
angular momentum of the body relative to the CM. 4 

3 What we mean here is the following. Consider a coordinate system whose origin is the CM and 
whose axes are parallel to the fixed x- and y- axes. Then the pancake rotates with angular speed a/ 
in this reference frame. 

4 This theorem only works if we use the CM as the location of the imagined point mass. True, 
in the above analysis we could have chosen a point P other than the CM, and then written things 
in terms of the coordinates of P and the coordinates relative to P (which could also be described 
by a rotation). But then the cross terms in eq. (7.9) wouldn’t vanish, and we’d end up with an 
unenlightening mess. 
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Note that if we have the special case where the CM travels around the origin in 
a circle, with angular speed LI (so that V = LIR), then eq. (7.9) becomes L = 
( MR 2 Ll + z. 

Now let’s look at T. The kinetic energy is 

T = J dm 
= Jl\V + V\ 2 dm 

1 f 1 

= -MV 2 + I -v' 2 dm (cross term vanishes; see below) 

= X -MV 2 + J l -r' 2 J 2 d.m 

& X -MV 2 + * 7 z CM u/ 2 . (7.10) 

In going from the second to third line above, the cross term / V • v 7 dm vanishes by 
definition of the CM, as in the above calculation of L. Again, eq. (7.10) is a very 
nice result. In words, it says: 

Theorem 7.2 The kinetic energy of a body can be found by treating the body as a 
point mass located at the CM, and by then adding on the kinetic energy of the body 
due to the motion relative to the CM. 

To calculate E, my dear class, 

Just add up two things, and you’ll pass. 

Take the CM point’s E, 

And then add on with glee, 

The E ’round the center of mass. 

7.1.3 The parallel-axis theorem 

Consider the special case where the CM rotates around the origin at the same rate 
as the body rotates around the CM. This may be achieved, for example, by gluing a 
stick across the pancake and pivoting one end of the stick at the origin; see Fig. 7.5. 
In this special case, we have the simplified situation where all points in the pancake 
travel in circles around the origin. Let their angular speed be u. 

In this situation, the speed of the CM is V = ujR, so eq. (7.9) says that the 
angular momentum around the origin is 

L z = (MR 2 + I™)u. (7.11) 

In other words, the moment of inertia around the origin is 

\l z = MR 2 + I™\ . (7.12) 

This is the parallel-axis theorem. It says that once you’ve calculated the moment of 
inertia of an object around the axis passing through the CM (namely I^ M ), then if 



Figure 7.5 
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you want to calculate the moment of inertia around a parallel axis passing through 
an arbitrary point in the plane of the pancake, you simply have to add on MR 2 , 
where R is the distance from the point to the CM, and M is the mass of the pancake. 

Note that the parallel-axis theorem is simply a special case of the more general 
result in eq. (7.9), so it is valid only with the CM, and not with any other point. 

We can also look at the kinetic energy in this special case where the CM rotates 
around the origin at the same rate as the body rotates around the CM. Using 
V = ujR in eq. (7.10), we find 


T= ^(MR 2 + I™)uj 2 . 


(7.13) 


Example (A stick): Let’s verify the parallel-axis theorem for a stick of mass m 
and length £, in the case where we want to compare the moment of inertia around 
an axis through an end with the moment of inertia around an axis through the CM. 
Both of the axes are perpendicular to the stick, and parallel to each other, of course. 
For convenience, let p = m/l be the density. The moment of inertia around an axis 
through an end is 

pnd _ f x 2dm = f x 2 pdx = \pl 3 = \(pl)l 2 = \ml 2 . (7-14) 

Jo Jo 3 3 3 

The moment of inertia around an axis through the CM is 

r e/2 r e/2 

j CM _ / x 2 dm= / x 2 pdx =—pi 3 =—ml 2 . (7-15) 

J-l/2 J-i/2 12 12 

This is consistent with the parallel-axis theorem, eq. (7.12), because 

7 ' nd = ™(0 S * ,OM < 716 > 


Remember that this works only with the CM. If we instead want to compare 7 end 
with the I around a point, say, 1 /6 from that end, then we cannot say that they differ 
by mill 6) 2 . But we can compare each of them to I CM and say that they differ by 
(1/ 2) 2 - (1/ 3) 2 = 57 2 /36. 


pancake 

Q 


7.1.4 The perpendicular-axis theorem 

This theorem is valid only for pancake objects. Consider a pancake object in the 
x-y plane (see Fig. 7.6). Then the perpendicular-axis theorem says that 

h = Ix + Iy , (7.17) 


Figure 7.6 
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where I x and I y are defined analogously to the I z in eq. (7.4). That is, to find I x . 
imagine spinning the object around the a>axis at angular speed u ;, and then define 
I x = L x /uj. Likewise for I y . In other words, 

I x = J(y 2 + z 2 )dm , I y = J(z 2 + x 2 )dm, I z = J(x 2 + y 2 ) dm. (7.18) 

To prove this theorem, we simply use the fact that z = 0 for our pancake object. 
Eq. (7.18) then gives I z = I x + I y . 

In the limited number of situations where this theorem is applicable, it can save 
you some trouble. A few examples are given in Section 7.3.1 

7.2 Non-planar objects 

In Section 7.1, we restricted the discussion to pancake objects in the x-y plane. 
However, nearly all the results we derived carry over to non-planar objects, provided 
that the axis of rotation is parallel to the z-axis, and provided that we are concerned 
only with L z , and not L x or L y . So let’s drop the pancake assumption and run 
through the results we obtained above. 

First, consider an object rotating around the z-axis. Let the object have exten¬ 
sion in the z direction. If we imagine slicing the object into pancakes parallel to the 
x-y plane, then eqs. (7.4) and (7.5) correctly give L z for each pancake. And since 
the L z of the whole object is simply the sum of the L z s of all the pancakes, we see 
that the I z of the whole object is simply the sum of the I z s of all the pancakes. The 
difference in the z values of the pancakes is irrelevant. Therefore, for any object, we 
have 

l z — j(x 2 + y 2 ) dm, and L z = I z u>, (7.19) 

where the integration runs over the entire volume of the body. In Section 7.3.1 we 
will calculate I z for many non-planar objects. 

Even though eq. (7.19) gives the L z for an arbitrary object, the analysis in 
this chapter is still not completely general because (1) we are restricting the axis 
of rotation to be the (fixed) z-axis, and (2) even with this restriction, an object 
outside the x-y plane might have nonzero x and y components of L; we found only 
the z-component in eq. (7.19) This second fact is strange but true. Ponder it for 
now; we’ll deal with it in Section 8.2. 

As far as the kinetic energy goes, the T for a non-planar object rotating around 
the z-axis is still given by eq. (7.8), because we can obtain the total T by simply 
adding up the T’s of each of the pancake slices. 

Also, eqs. (7.9) and (7.10) continue to hold for a non-planar object in the case 
where the CM is translating while the object is spinning around it (or more precisely, 
spinning around an axis parallel to the z-axis and passing through the CM). The 
velocity V of the CM can actually point in any direction, and these two equations 
will still be valid. But we’ll assume in this chapter that all velocities are in the x-y 
plane. 
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Lastly, the parallel-axis theorem still holds for non-planar object. But as men¬ 
tioned in Section 7.1.4, the perpendicular-axis theorem does not. This is the one 
instance where we need the planar assumption. 

Finding the CM 

The center of mass has come up repeatedly in this Chapter. For example, when 
we used the parallel-axis theorem, we needed to know where the CM was. In some 
cases, such as with a stick or a disk, the location is obvious. But in other cases, 
it isn’t so clear. So let’s get a little practice calculating the location of the CM. 
Depending on whether the mass distribution is discrete or continuous, the position 
of the CM is defined by (see eq. (4.69)) 

R CM = E|p, or Rcm = 1^. (7.20) 

where M is the total mass. 

Let’s do an example with a continuous mass distribution. As with many problems 
involving an integral, the main step in the solution is deciding how you want to slice 
up the object to do the integral. 


Example (Hemispherical shell): Find the location of the CM of a hollow hemi¬ 
spherical shell, with uniform mass density and radius R. 

Solution: By symmetry, the CM is located on the line above the center of the base. 
So our task reduces to finding the height, ycu- Let the mass density be o. We’ll slice 
the hemisphere up into horizontal rings, described by the angle 9 above the horizontal, 
as shown in Fig. 7.7. If the angular thickness of a ring is dO, then its mass is 

dm = odA= o (length) (width) = cr (2n R cos 0)(Rd0). (7-21) 

All points on the ring have a y value of R sin 9. Therefore, 

y C M = jjJydm = j^ 2 (RAn0)(2nR 2 ocos9d6) 

r /2 

= R sin 0 cos 0d9 

Jo 

■Rsin 2 fl r /2 

2 lo 

= f • (7-22) 

The simple factor of 1/2 here is nice, but it’s not all that obvious. It comes from the 
fact that each value of y is represented equally. If you solved the problem by doing a 
dy integral instead of a d9 one, you would find that there is the same area (and hence 
the same mass) in each ring of height dy. You are encouraged to work this out. 
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The calculation of a CM is very similar to the calculation of a moment of inertia. 
Both involve an integration over the mass of an object, but the former has one power 
of a length in the integrand, whereas the latter has two powers. 


7.3 Calculating moments of inertia 

7.3.1 Lots of examples 

Let’s now calculate the moments of inertia of various objects, around specified axes. 
We will use p to denote mass density (per unit length, area, or volume, as appro¬ 
priate). We will assume that this density is uniform throughout the object. For 
the more complicated of the objects below, it is generally a good idea to slice the 
object up into pieces for which / is already known. The problem then reduces to 
integrating over these known J’s. There is usually more than one way to do this 
slicing. For example, a sphere may be looked at as a series of concentric shells or 
a collection of disks stacked on top of each other. In the examples below, you may 
want to play around with slicings other than the ones given. Consider at least a few 
of these examples to be problems and try to work them out for yourself. 


1. A ring of mass M and radius R (axis through center, perpendicular to plane; Fig. 7.8): 

I = J r 2 dm = R 2 pR dO = (2tt Rp)R 2 , (7.23) 

as it should be, because all of the mass is a distance R from the axis. 

2. A ring of mass M and radius R (axis through center, in plane; Fig. 7.8): 

The distance from the axis is (the absolute value of) I? sin#. Therefore, 

I = Jr 2 dm = (RsinO) 2 pRdO = |(2tt Rp)R 2 = | \MR 2 j , (7.24) 

where we have used sin 2 # = (1 — cos 29)/2. You can also find I by using the 
perpendicular-axis theorem. In the notation of section 7.1.4, we have I x = I y , by 
symmetry. Therefore, I z = 2 I x . Using I z = MR 2 from Example 1 then gives 
4 = MR 2 /2. 

3. A disk of mass M and radius R (axis through center, perpendicular to plane; Fig. 7.9): 

I = J r 2 dm= J J r 2 prdr dO = (R 4 ‘/A)2-rrp = ^(pttR 2 )R 2 = (7.25) 

You can save one (trivial) integration step by considering the disk to be made up of 
many concentric rings, and invoking Example 1. The mass of each ring is p2nrdr. 
Integrating over the rings gives J = f^(p2nr dr)r 2 = it,R 4, p/2 = MR 2 / 2, as above. 
Slicing up the disk is fairly inconsequential in this example, but it will save you some 
trouble in others. 
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Figure 7.8 
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Figure 7.10 
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4. A disk of mass M and radius R (axis through center, in plane; Fig. 7.9): 

Slice the disk up into rings, and use Example 2. 

f R 1 i-1 

I = / (l/2)(p27rrdr)r 2 = (R 2 /A)pn = -(fmR?)R 2 = \MR 2 . (7.26) 

Jo 4 L- 1 

Or, just use Example 3 and the perpendicular-axis theorem. 

5. A thin uniform rod of mass M and length L (axis through center, perpendicular to 
rod; Fig. 7.10): 

f r^/ 2 i i-1 

I = x 2 dm = / x 2 pdx = — ( pL)L 2 = j~ML 2 . (7.27) 

J J-L/2 12 1 - 1 


6. 


A thin uniform rod of mass M and length L (axis through end, perpendicular to rod; 
Fig. 7.10): 


= J x 2 dm = J 


x 2 pdx = -(pL)L 2 = 


(7.28) 


7. A spherical shell of mass M and radius R (any axis through center; Fig. 7.11): 

Let’s slice the sphere into horizontal ring-like strips. In spherical coordinates, the 
radius of a ring is given by r = RsinO, where 9 is the angle down from the north 
pole. The area of a strip is then 2ir(Rsm 9)RdO. Using f sin 3 9 = f sin0(1 —cos 2 9) = 
— cos 9 + cos 3 9/3, we have 


= / r 2 dm = / (Rsin9) 2 2irp(Rsin 9)Rd9 = 2-irpR 4 / sin 3 0 

J Jo Jo 

= 2npR i (A/3) = \(A-kR 2 P )R 2 = 1 §Mi? 2 |. 


(7.29) 


8. A solid sphere of mass M and radius R (any axis through center; Fig. 7.11): 

A sphere is made up of concentric spherical shells. The volume of a shell is A-Kr 2 dr. 
Using Example 7, we have 

r R 2 i-1 

I = / (2/3)(4t T P r 2 dr)r 2 = (R 5 / 5)(8ttp/3) = -(4/3tt R?p)R 2 = \MR 2 . (7.30) 
Jo 5 I-1 


9. An infinitesimally thin triangle of mass M and length L (axis through tip, perpen¬ 
dicular to plane; Fig. 7.12): 

Let the base have length a, where a is infinitesimally small. Then a slice at a distance 
x from the tip has length a(x/L). If the slice has thickness dx, then it is essentially 
a point mass of mass dm = paxdx/L. Therefore, 

1 = J x 2 dm = jf x 2 pax/L dx=^ {paL/2)L 2 = \ML 2 , (7.31) 

because aL/2 is the area of the triangle. This of course has the same form as the disk 
in Example 3, because a disk is made up of many of these triangles. 

10. An isosceles triangle of mass M, vertex angle 2/3, and common-side length L (axis 
through tip, perpendicular to plane; Fig. 7.12): 
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Let h be the altitude of the triangle (so h = L cos 0). Slice the triangle into thin strips 
parallel to the base. Let x be the distance from the vertex to a thin strip. Then the 
length of a strip is I = 2x tan 0, and its mass is dm = p(2x tan 0 dx). Using Example 
5 above, along with the parallel-axis theorem, we have 

I = £dm (g + A) = jf'Wanff.fc) + A) 

= £ 2„w(l + ^)y<fc-2pW ( 1 +^) 7 . (7-32) 

But the area of the whole triangle is h 2 tan 0, so we have I = (Mh 2 / 2)(1 + tan 2 0/3). 
In terms of L, this is 


I = (ML 2 /2)(cos 2 0 + sin 2 0/3) = I ^ML 2 (l — | sin 2 0) I. (7.33) 


11. A regular V-gon of mass M and “radius” R (axis through center, perpendicular to 
plane; Fig. 7.13): 

The A r -gon is made up of N isosceles triangles, so we can use Example 10, with 
0 = 7 t/N. The masses of the triangles simply add, so if M is the mass of the whole 
V-gon, we have 

I = \MR 2 ( 1- |sin 2 f) . (7.34) 

Let’s list the values of I for a few N. We’ll use the shorthand notation (IV, I /MR 2 ). 
Eq. 7.34 gives (3, |), (4, |], (6, £), (oo, |). These values of I form a nice arithmetic 
progression. 

12. A rectangle of mass M and sides of length a and b (axis through center, perpendicular 
to plane; Fig. 7.13): 

Let the 2 -axis be perpendicular to the plane. We know that I x = Mb 2 / 12 and 
I y = Ma 2 / 12, so the perpendicular-axis theorem tells us that 

I z = I x + I y = T M (a 2 + b 2 )] . (7.35) 



• | b 

a 

Figure 7.13 


7.3.2 A neat trick 

For some objects with certain symmetries, it is possible to calculate I without doing 
any integrals. All that is needed is a scaling argument and the parallel-axis theorem. 
We will illustrate this technique by finding I for a stick (Example 5 above). Other 
applications can be found in the problems for this chapter. 

In the present example, the basic trick is to compare / for a stick of length L 
with I for a stick of length 2 L. A simple scaling argument shows the latter is eight 
times the former. This is true because the integral / x 2 dm = f x 2 pdx has three 
powers of x in it. So a change of variables, y = 2x, brings in a factor of 2 3 = 8. 
Equivalently, if we imagine expanding the smaller stick into the larger one, then a 
corresponding piece will be twice as far from the axis, and also twice as massive. 
The integral / x 2 dm therefore increases by a factor of 2 2 • 2 = 8. 
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The technique is most easily illustrated with pictures. If we denote the moment 
of inertia of an object by a picture of the object, with a dot signifying the axis, then 
we have: 


L L 


•-- + «(4) 2 

The first line comes from the scaling argument, the second line comes from the 
fact that moments of inertia simply add (the left-hand side is two copies of the right- 
hand side, attached at the pivot), and the third line comes from the parallel-axis 
theorem. Equating the right-hand sides of the first two equations gives 

Plugging this expression for •- into the third equation gives the desired 

result, 



Note that sooner or later we must use real live numbers, which enter here through 
the parallel-axis theorem. Using only scaling arguments isn’t sufficient, because they 
provide only linear equations homogeneous in the P s, and therefore give no means 
of picking up the proper dimensions. 

Once you’ve mastered this trick and applied it to the fractal objects in Problem 
6, you can impress your friends by saying that you can “use scaling arguments, 
along with the parallel-axis theorem, to calculate moments of inertia of objects with 
fractal dimension.” And you never know when that might come in handy! 

7.4 Torque 

We will now show that (under certain conditions, stated below) the rate of change 
of angular momentum is equal to a certain quantity, r, which we call the torque. 
That is, r = dL/dt. This is the rotational analog of our old friend F = dp/dt 
involving linear momentum. The basic idea here is straightforward, but there are 
two subtle issues. One deals with internal forces within a collection of particles. 
The other deals with origins (the points relative to which the angular momentum 
is calculated) that are not fixed. To keep things straight, we’ll prove the general 
theorem by dealing with three increasingly complicated situations. 

Our derivation of r = dL/dt here holds for completely general motion; we can 
take the result and use it in the following chapter, too. If you wish, you can construct 
a more specific proof of r = dL/dt for the special case where the axis of rotation is 
parallel to the z-axis. But since the general proof is no more difficult, we’ll present 
it here in this chapter and get it over with. 
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7.4.1 Point mass, fixed origin 

Consider a point mass at position r relative to a fixed origin (see Fig. 7.14). The 
time derivative of the angular momentum, L = r x p, is 


dL 

dt 


* (rxp) 

dr dp 

Tt xp + rx ^ 


dt 


< (mv) + 
- r x F, 


(7.36) 


where F is the force acting on the particle. This is the same proof as in Theorem 
6.1, except that here we are considering an arbitrary force instead of a central one. 
If we define the torque on the particle as 


then eq. (7.36) becomes 


t = r 


F, 


dL 

dt 


(7.37) 

(7.38) 


7.4.2 Extended mass, fixed origin 

In an extended object, there are internal forces acting on the various pieces of the 
object, in addition to whatever external forces exist. For example, the external force 
on a given atom in a body might come from gravity, while the internal forces come 
from the adjacent atoms. How do we deal with these different types of forces? 

In what follows, we will deal only with internal forces that are central forces, so 
that the force between two objects is directed along the line between them. This is 
a valid assumption for the pushing and pulling forces between molecules in a solid. 
(It isn’t valid, for example, when dealing with magnetic forces. But we won’t be 
interested in such things here.) We will invoke Newton’s third law, which says that 
the force that particle 1 applies to particle 2 is equal and opposite to the force that 
particle 2 applies to particle 1. 

For concreteness, let us assume that we have a collection of N discrete particles 
labeled by the index i (see Fig. 7.15). In the continuous case, we would need 
to replace the following sums with integrals. The total angular momentum of the 
system is 

N 

L = E r . x p*- ( 7 - 39 ) 

The force acting on each particle is F| xt + Fj nt = dpi/dt. Therefore, 


dL 

dt 


r i x pi 


E drj 

~dt 


p< + E r 


x 


dpi 

dt 



Figure 7.14 


y 

I N= 4 



Figure 7.15 
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Figure 7.16 
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= ^Vj x (mvj) + Er* x (F| xt + F- nt ) 

= O + ^riXFf* 

= E r i Xt - (7-40) 

The second-to-last line follows because v* x v* = 0, and also because J2i r i x Fj nt = 0, 
as you can show in Problem 8. In other words, the internal forces provide no net 
torque. This is quite reasonable. It basically says that a rigid object with no external 
forces won’t spontaneously start rotating. 

Note that the right-hand side involves the total external torque acting on the 
body, which may come from forces acting at many different points. Note also that 
nowhere did we assume that the particles were rigidly connected to each other. Eq. 
(7.40) still holds even if there is relative motion among the particles. 

7.4.3 Extended mass, non-fixed origin 

Let the position of the origin be ro (see Fig. 7.16), and let the positions of the 
particles be r*. The vectors ro and r, are measured with respect to a given fixed 
coordinate system. The total angular momentum of the system, relative to the 
(possibly moving) origin ro, is 

L = E( r * - r o) x rni(ii ~ *o)- (7.41) 

Therefore, 

f = s(l>- r °)* oxft-fo)) 

= E(^ _ *o) X rrii{ii ~ ro) + E( r * - r o) x m *(ri - ro) 

= 0 + E(r* - ro) x (F| xt + Fj nt - m;fo), (7.42) 

because mir* is the net force (namely F® xt + Fj nt ) acting on the ith particle. But 
a quick corollary of Problem 8 is that the term involving Fj nt vanishes (show this). 
And since X) m i r i = MR (where M = )V m, is the total mass, and R is the position 
of the center of mass), we have 

^ = E(r; - ro) X F| xt - M(R - r 0 ) X r 0 . (7.43) 

The first term is the external torque, relative to the origin ro- The second term is 
something we wish would go away. And indeed, it usually does. It vanishes if any 
of the following three conditions is satisfied. 


1. R = tq. That is, the origin is the CM. 
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2. ro = 0. That is, the origin is not accelerating. 


3. (R — ro) is parallel to ro- This condition is rarely invoked. 
If any of these conditions is satisfied, then we are free to write 




(7.44) 


In other words, we can equate the total torque with the rate of change of the total 
angular momentum. An immediate corollary of this result is: 


Corollary 7.3 If the total torque on a system is zero, then its angular momentum 
is conserved. In particular, the angular momentum of an isolated system (one that 
is subject to no external forces) is conserved. 

Everything up to this point is valid for arbitrary motion. The particles can be 
moving relative to each other, and the various Lj’s can point in different directions, 
etc. But let’s now restrict the motion. In the present chapter, we are dealing 
only with cases where L is constant (taken to point in the z-direction). Therefore, 
dL/dt = d(LL)/dt = (dL/dt) L. If in addition we consider only rigid objects (where 
the relative distances among the particles is fixed) that undergo pure rotation around 
a given point, then L = Iu, which gives dL/dt = Icu = la. Taking the magnitude 
of both sides of eq. (7.44) then gives 


(7.45) 


Invariably, we will calculate angular momentum and torque around either a fixed 
point or the CM. These are “safe” origins, in the sense that eq. (7.44) holds. As 
long as you vow to always use one of these safe origins, you can simply apply eq. 
(7.44) and not worry much about its derivation. 

Remarks on the third condition: You’ll probably never end up invoking the third 
condition above, but it’s interesting to note that there is a simple way of understanding it 
in terms of accelerating reference frames. This is the topic of Chapter 9, so we’re getting 
a little ahead of ourselves here, but the reasoning is as follows. Let r 0 be the origin of a 
reference frame that is accelerating with acceleration ro- Then all objects in this accelerated 
frame feel a mysterious fictitious force of — mfo- For example, on a train accelerating to the 
right with acceleration a, you feel a strange force to the left of ma. If you don’t counter this 
with another force, you will fall over. The fictitious force acts just like a gravitational force. 
Therefore, it effectively acts at the CM, producing a torque of (R — ro) x (—Mfo). This is 
the second term in eq. (7.43). This term will vanish if the CM is directly “above” (as far 
as the fictitious gravitational force is concerned) the origin, in other words, if (R — ro) is 
parallel to ro- 

There is one common situation where the third condition can be invoked. Consider a 
wheel rolling without slipping on the ground. Mark a point on the rim. At the instant this 
point in in contact with the ground, it is a valid choice for the origin. This is true because 
(R — r 0 ) points vertically. And r 0 also points vertically. A point on a rolling wheel traces 
out a cycloid. Right before the point hits the ground, it is moving straight downward; right 
after it hits the ground, it is moving straight upward. But never mind, it’s still a good idea 
to pick your origin to be the CM or a fixed point, even if the third condition holds, ft 
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Figure 7.17 



Figure 7.18 


For conditions that number but three, 
We say, “Torque is dL by dt.” 

But though they’re all true, 

I’ll stick to just two; 

It’s CM’s and fixed points for me. 


Example: A string wraps around a uniform cylinder of mass M, which rests on a 
fixed plane. The string passes up over a massless pulley and is connected to a mass 
m, as shown in Fig. 7.17. Assume that the cylinder rolls without slipping on the 
plane, and that the string is parallel to the plane. What is the acceleration of the 
mass to? What is the minimum value of M /to for which the cylinder accelerates down 
the plane? 


Solution: The friction, tension, and gravitational forces are shown in Fig. 7.18. 
Define positive a\, a 2 , and a as shown. These three accelerations, along with T and 
F, are five unknowns. We therefore need to produce five equations. They are: 

(1) F = ma on to => T — mg = ma 2 . 

(2) F = ma on M => Mgsvn.6 — T — F = Ma\. 

(3) r = la on M (around the CM) =► FR — TR = ( MR 2 /2)a. 

(4) Non-slipping condition => a = a\/R. 

(5) Conservation of string => a 2 = 2oi. 


A few comments on these equations: The normal force and the gravitational force 
perpendicular to the plane cancel, so we can ignore them. We have picked positive F 
to point up the plane, but if it happens to point down the plane and thereby turn out 
to be negative, that’s fine (but it won’t); we don’t need to worry about which way it 
really points. In (3), we are using the CM of the cylinder as our origin, but we can 
also use a fixed point; see the remark below. In (5), we have used the fact that the 
top of a rolling wheel moves twice as fast as the center. This is true because it has 
the same speed relative to the center as the center had relative to the ground. 

We can go about solving these five equations in various ways. Three of the equations 
involve only two variables, so it’s not so bad. (3) and (4) give F — T = Mai/2. 
Adding this to (2) gives Mg sin# — 2T = 3 Mai /2. Using (1) to eliminate T, and 
using (5) to write a-\ in terms of a 2 , then gives 


Mg sin 9 — 2 (mg + ma 2 ) 


3Ma 2 


_ (M sin 9 — 2 m)g 
a2 | M + 2 to 


And ai = a 2 /2. We see that aq is positive (that is, the cylinder rolls down the plane) 
if M/to > 2/sin#. 


Remark: In using r = dL/dt, we can also pick a fixed point as our origin, instead of the 
CM. The most sensible point is one located somewhere along the plane. The Mg sin# force 
now provides a torque, but the friction does not. The angular momentum of the cylinder 
with respect to a point on the plane is Iuj + MvR, where the second term comes from the 
L due to the object being treated like a point mass at the CM. So r = dL/dt becomes 
(Mgsin9)R — T(2R) = la + MaiR. This is simply the sum of the third equation plus R 
times the second equation above. We therefore obtain the same result. £ 
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7.5 Collisions 

In Section 4.7, we looked at collisions involving point particles (or otherwise non¬ 
rotating objects). The fundamental ingredients we used to solve a collision problem 
were conservation of momentum and conservation of energy (if the collision was 
elastic). With conservation of angular momentum now at our disposal, we can 
extend our study of collisions to ones that contain rotating objects. The additional 
fact of conservation of L will be compensated for by the new degree of freedom for 
the rotation. Therefore, provided that the problem is set up properly, we will still 
have the same number of equations as unknowns. 

In an isolated system, conservation of energy can be used only if the collision is 
elastic (by definition). But conservation of angular momentum is similar to conser¬ 
vation of momentum, in that it can always be used. However, conservation of L is 
a little different from conservation of p, because we have to pick an origin before we 
can proceed. In view of the three conditions that are necessary for Corollary 7.3 to 
hold, we must pick our origin to be either a fixed point or the CM of the system 
(we’ll ignore the third condition, since it’s rarely used). If we choose some other 
point, then r = dL/dt does not hold, so we have no right to claim that dL/dt equals 
zero just because the torque is zero (as it is for an isolated system). 

There is, of course, some freedom in choosing an origin from among the legal 
possibilities of fixed points or the CM. And since it is generally the case that one 
choice is better than the others (in that it makes the calculations easier), you should 
take advantage of this freedom. 

Let’s to two examples. First, and elastic collision, and then an inelastic one. 


Example (Elastic collision): A mass m travels perpendicularly to a stick of mass 
m and length £, which is initially at rest. At what location should the mass collide 
elastically with the stick, so that the mass and the center of the stick move with equal 
speeds after the collision? 

Solution: Let the initial speed of the mass be vo■ We have three unknowns in the 
problem (see Fig. 7.19), namely the desired distance from the middle of the stick, 
h\ the final (equal) speeds of the stick and the mass, v: and the final angular speed 
of the stick, u>. We can solve for these three unknowns by using our three available 
conservation laws: 

• Conservation of p: 


Figure 7 


mv o = mv + mv 


v 


vo 
2 ‘ 


• Conservation of E: 

mv § m / v o \ 2 I" m / no \ 2 1/ ml 2 \ 2 1 

~ 2 ~ = Tvyj + [~2VyJ + 2VT2/ W J 
_ 


(7.47) 


(7.48) 
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• Conservation of L: Let’s pick our origin to be the fixed point in space that 
coincides with the initial location of the center of the stick. Then conservation 
of L gives 

m „„/. = m (|) fe+ [(^) W + o]. (7.49) 

The zero here comes from the fact that the CM of the stick moves directly away 
from the origin, so there is no contribution to L from the first of the two parts 
in Theorem 7.1. Plugging the u> from eq. 7.48 into eq. 7.49 gives 



Let’s now do an inelastic problem, where one object sticks to another. We won’t 
be able to use conservation of E now. But conservation of p and L will be sufficient, 
because there is one fewer degree of freedom in the final motion, due to the fact that 
the objects do not move independently. 



Figure 7.20 


Example (Inelastic collision): A mass m travels at speed Vu perpendicularly to a 
stick of mass m and length l, which is initially at rest. The mass collides completely 
inelastically with the stick at one of its ends, and sticks to it. What is the resulting 
angular velocity of the system? 

Solution: The first thing to note is that the CM of the system is f/4 from the end, 
as shown in Fig. 7.20. The system will rotate about the CM as the CM moves in a 
straight line. Conservation of momentum quickly tells us that the speed of the CM 
is uo/2. Also, using the parallel-axis theorem, the moment of inertia of the system 
about the CM is 



There are now many ways to proceed, depending on what point we choose as our 
origin. 


First method: Choose the origin to be the fixed point that coincides with the 
location of the CM right when the collision happens (that is, the point 1 /4 from the 
end of the stick). Conservation of L says that the initial L of the ball must equal the 
final L of the system. This gives 


™vofy = (^me 2 )u + 0. 


6v 0 


U ' 


(7.52) 


The zero here comes from the fact that the CM of the stick moves directly away from 
the origin, so there is no contribution to L from the first of the two parts in Theorem 
7.1. Note that we didn’t need to use conservation of p in this method. 


Second method: Choose the origin to be the fixed point that coincides with the 
initial center of the stick. Then conservation of L gives 
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The right-hand side is the angular momentum of the system relative to the CM, plus 
the angular momentum (relative to the origin) of a point mass of mass 2 m located at 
the CM. 

Third method: Choose the origin to be the CM of the system. This point moves 
to the right with speed vo/2, along the line a distance i /4 below the top of the stick. 
Relative to the CM, the mass m moves to the right, and the stick moves to the left, 
both with speed vo/2. Conservation of L gives 

The zero here comes from the fact that the stick initially has no L around its center. 
A fourth reasonable choice for the origin is the fixed point that coincides with the 
initial location of the top of the stick. You can work this one out for practice. 


7.6 Angular impulse 

In Section 4.5.1, we defined the impulse, X, to be the time integral of the force 
applied to an object, which is the net change in linear momentum. That is, 

X = J* F(t) dt = Ap. (7.55) 

We now define the angular impulse, Xq, to be the time integral of the torque 
applied to an object, which is the net change in angular momentum. That is, 
fh 

X e = y r(t) dt = AL. (7.56) 

These are just definitions, devoid of any content. The place where the physics 
comes in is the following. Consider a situation where F(t) is always applied at the 
same position relative to the origin around which r(t) is calculated. Let this position 
be R. Then we have r(t) = R x F(f). Plugging this into eq. (7.56), and taking the 
constant R outside the integral, gives Xq = R x I. That is, 

AL = R x (Ap) (for F(t) applied at one position). (7-57) 

This is a very useful result. It deals with the net changes in L and p, and not 
with their changes at any particular instant. Even if F is changing in some arbitrary 
manner as time goes by, so that we have no idea what Ap and AL are, we still know 
that they are related by eq. (7.57). Also, note that the derivation of eq. (7.57) was 
completely general, so we can apply it in the next chapter, too. 

In many cases, we don’t have to worry about the cross product in eq. (7.57), 
because the lever arm R is perpendicular to the change in momentum Ap. In such 
cases, we have 

\AL\ = R\Ap\. (7.58) 

Also, in many cases the object starts at rest, so we don’t have to bother with the A’s. 
The following example is a classic application of angular impulse and eq. (7.58). 
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Example (Striking a stick): A stick of mass m and length £, initially at rest, is 
struck with a hammer. The blow is made perpendicular to the stick, at one end. Let 
the blow occur quickly, so that the stick doesn’t move much while the hammer is in 
contact. If the CM of the stick ends up moving at speed v, what are the velocities of 
the ends, right after the blow? 

Solution: We have no idea exactly what F(t) looks like, or for how long it is applied, 
but we do know from eq. (7.58) that A L = (£/2)A p, where L is calculated relative to 
the CM (so the lever arm is £/2). Therefore, (m£ 2 /12)u> = ( l/2)mv . Hence, the final 
v and u> are related by uj = 6v/£. 

The velocities of the ends are obtained by adding (or subtracting) the rotational 
motion to the CM’s translational motion. The rotational velocities of the ends are 
±w(i/ 2) = ±(&v/()(i/2) = ±3v. Therefore, the end that was hit moves with velocity 
v + 3v = To, and the other end moves with velocity v — 3v = —2v (that is, backwards). 


What L was, he just couldn’t tell. 

And pi He was clueless as well. 

But despite his distress, 

He wrote down the right guess 
For their quotient: the lever-arm’s i. 

Impulse is also useful for “collisions” that occur over extended times (see, for 
example, Problem 18). 
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7.7 Exercises 


Section 7.2: Non-planar objects 

1. Semicircle CM * 

A wire is bent into a semicircle of radius R. Find the location of the center of 
mass. 

2. Triangle CM * 

Find the CM of an isosceles triangle. 

3. Hemisphere CM * 

Find the CM of a solid hemisphere. 


Section 7.3: Calculating moments of inertia 

4. A cone * 

Find the moment of inertia of a solid cone (mass M, base radius R) around 
its symmetry axis. 

5. Triangle, the slick way * 

In the spirit of Section 7.3.2, find the moment of inertia of a uniform equilateral 
triangle of mass m and side L around a line joining a vertex to the opposite 
side (see Fig. 7.21). 

6. Fractal triangle ** 

Take an equilateral triangle of side £, and remove the “middle” triangle (1/4 
of the area). Then remove the “middle” triangle from each of the remaining 
three triangles, and so on, forever. Let the final fractal object have mass m. 
In the spirit of Section 7.3.2, find the moment of inertia around a line joining 
a vertex to the opposite side (see Fig. 7.22). Be careful how the mass scales. 

Section 7.j: Torque 

7. Swinging your arms * 

You are standing on the edge of a step on some stairs, facing up the stairs. 
You feel yourself starting to fall backwards, so you start swinging your arms 
around in vertical circles, like a windmill. This is what people tend to do in 
such a situation, but does it actually help you not to fall, or does it simply 
make you look silly? Explain your reasoning. 



Figure 7.21 



Figure 7.22 


8. Wrapping around the pole * 

A hockey puck, sliding on frictionless ice, is attached by a piece of string 
(lying along the surface) to a thin vertical pole. The puck is given a tangential 
velocity, and as the string wraps around the pole, the puck gradually spirals in. 
Is the following statement correct? “From conservation of angular momentum, 
the speed of the puck will increase as the distance to the pole decreases.” 



pivot 


VII-22 


CHAPTER 7. ANGULAR MOMENTUM, PART I (CONSTANT L ) 


* L 1 

Figure 7.23 



m 2m 

Figure 7.24 



Figure 7.25 


9. Falling quickly * 

A massless stick of length L is pivoted at one end and has a mass m attached 
to its other end. It is held in a horizontal position, as shown in Fig. 7.23. 
Where should a second mass m be attached to the stick, so that the stick falls 
as fast as possible when dropped? 

10. Massive pulley * 

Consider the Atwood’s machine shown in Fig. 7.24. The masses are m and 
2m, and the pulley is a uniform disk of mass m and radius r. The string is 
massless and does not slip with respect to the pulley. Find the acceleration of 
the masses. 

11. Atwood’s with a cylinder ** 

A massless string of negligible thickness is wrapped around a uniform cylinder 
of mass m and radius R. The string passes up over a massless pulley and is 
tied to a block of mass m at its other end, as shown in Fig. 7.25. What are 
the accelerations of the block and the cylinder? Assume that the string does 
not slip with respect to the cylinder. 

12. Maximum frequency * 

A pendulum is made of a uniform stick of length i. A pivot is placed somewhere 
along the stick, which is allowed to swing in a vertical plane. Where should 
the pivot be placed on the stick so that the frequency of (small) oscillations is 
maximum? 



13. Rolling down the plane * 

An circular object with moment of inertia (3mr 2 rolls without slipping down 
a plane inclined at angle 9. What is its linear acceleration? 

14. Coin on a plane * 

A coin rolls down a plane inclined at angle 9. If the coefficient of static friction 
between the coin and the plane is /r, what is the largest angle 9 for which the 
coin doesn’t slip? 

15. Bowling ball on paper * 

A bowling ball sits on a piece of paper on the floor. You grab the paper and pull 
it horizontally along the floor, with acceleration a. What is the acceleration 
of the center of the ball? Assume that the ball does not slip with respect to 
the paper. 

16. Spring and cylinder * 

The axle of a solid cylinder (mass M , radius R) is connected to a spring with 
spring-constant k, as shown in Fig. 7.26. If the cylinder rolls without slipping, 
what is the frequency of oscillations? 


Figure 7.26 
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17. Another spring and cylinder ** 

The top of a solid cylinder (mass M, radius R) is connected to a spring (at 
its equilibrium length) with spring-constant k, as shown in Fig. 7.27. If the 
cylinder rolls without slipping, what is the frequency of (small) oscillations? 

18. The spool ** 

A spool of mass m and moment of inertia I (around the center) is free to roll 
without slipping on a table. It has an inner radius r, and an outer radius R. 
If you pull on the string with tension T at an angle 9 (see Fig. 7.28), what is 
the acceleration of the spool? Which way does it move? 

19. Stopping the coin ** 

A coin stands vertically on a table. It is projected forward (in the plane of 
itself) with speed v and angular speed w, as shown in Fig. 7.29. The coefficient 
of kinetic friction between the coin and the table is ji. What should v and 
u be so that the coin comes to rest (both translationally and rotationally) a 
distance d from where it started? 

20. Accelerating plane * 

A ball with I = (2/5 )MR 2 is placed on a plane inclined at angle 9. The plane 
is accelerated upwards (along its direction) with acceleration a; see Fig. 7.30. 
For what value of a will the CM of the ball not move? Assume that there is 
sufficient friction so that the ball doesn’t slip with respect to the plane. 

21. Raising the hoop ** 

A bead of mass m is positioned at the top of a frictionless hoop of mass M 
and radius R, which stands vertically on the ground. A wall touches the hoop 
on its left, and a short wall of height R touches the hoop on its right, as shown 
in Fig. 7.31. All surfaces are frictionless. The bead is given a tiny kick, and it 
slides down the hoop, as shown. What is the smallest value of m/M for which 
the hoop will rise up off the ground at some time during the motion? (Note: 
It is possible to solve this problem using only force, but solve it here by using 
torque.) 



Figure 7.27 



Figure 7.28 
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Figure 7.29 



Figure 7.30 


22. Coin and plank ** 

A coin of mass M and radius R stands vertically on the right end of a horizontal 
plank of mass M and length L, as shown in Fig. 7.32. The plank is pulled 
to the right with a constant force F. Assume that the coin does not slip with 
respect to the plank. What are the accelerations of the plank and coin? How 
far to the right does the coin move by the time the left end of the plank reaches 
it? 



Figure 7.31 



Figure 7.32 





Figure 7.34 



Figure 7.35 
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23. Board and cylinders *** 

A board lies on top of two uniform cylinders which lie on a fixed plane inclined 
at angle 0, as shown in Fig. 7.33. The board has mass to, and each of the 
cylinders has mass m/2. If there is no slipping between any of the surfaces, 
what is the acceleration of the board? 

24. Moving plane **** 

A disk of mass m and moment of inertia / = (3mr 2 is held motionless on a 
plane of mass M and angle of inclination 6 (see Fig. 7.34). The plane rests 
on a frictionless horizontal surface. The disk is released. Assuming that it 
rolls without slipping on the plane, what is the horizontal acceleration of the 
plane? Hint: You probably want to do Problem 2.2 first. 

25. Tower of cylinders **** 

Consider the infinitely tall system of identical massive cylinders and massless 
planks shown in Fig. 7.35. The moment of inertia of the cylinders is / = 
MR 2 / 2. There are two cylinders at each level, and the number of levels is 
infinite. The cylinders do not slip with respect to the planks, but the bottom 
plank is free to slide on a table. If you pull on the bottom plank so that it 
accelerates horizontally with acceleration a, what is the horizontal acceleration 
of the bottom row of cylinders? 

Section 7.5: Collisions 

26. Pendulum collision * 

A stick of mass m and length £ is pivoted at an end. It is held horizontally and 
then released. It swings down, and when it is vertical, the free end elastically 
collides with a ball, as shown in Fig. 7.36. (Assume that the ball is initially 
held, and then released a split second before the stick strikes it.) If the stick 
loses half of its angular velocity during the collision, what is the mass of the 
ball? What is the speed of the ball right after the collision? 

27. Spinning stick ** 

A stick of mass to and length £ spins around on a frictionless table, with its CM 
stationary (but not fixed by a pivot). A mass M is placed on the plane, and 
the end of the stick collides elastically with it, as shown in Fig. 7.37. What 
should M be so that after the collision the stick has translational motion, but 
no rotational motion? 


Figure 7.37 
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28. Another spinning stick ** 

A stick of mass to and length i initially rotates with frequency uona fric¬ 
tionless table, with its CM at rest (but not fixed by a pivot). A ball of mass 
to is placed on the table, and the end of the stick collides elastically with it, 
as shown in Fig. 7.38. What is the resulting angular velocity of the stick? 

29. Same final speeds * 

A stick slides (without rotating) across a frictionless table and collides elasti¬ 
cally at one of its ends with a ball. Both the stick and the ball have mass to. 
The mass of the stick is distributed in such a way that the moment of inertia 
around the CM (which is at the center of the stick) equals / = Ami 2 , where A 
is some number. What should A be so that the ball moves at the same speed 
as the center of the stick after the collision? 

30. No final rotation * 

A stick of mass to and length l spins around on a frictionless table, with its 
CM stationary (but not fixed by a pivot). It collides elastically with a mass 
to, as shown in Fig. 7.39. At what location should the collision occur (specify 
this by giving the distance from the center of the stick) so that the stick has 
no rotational motion afterwards? 

Section 7.6: Angular Impulse 

31. Center of percussion * 

You hold one end of a uniform stick of length L. The stick is struck with 
a hammer. Where should this blow occur so that the end you are holding 
doesn’t move (immediately after the blow)? In other words, where should the 
blow occur so that you don’t feel a “sting” in your hand? This point is called 
the center of percussion. 

32. Another center of percussion * 

You hold the top vertex of a solid equilateral triangle of side length L. The 
plane of the triangle is vertical. It is struck with a hammer, somewhere along 
the vertical axis. Where should this blow occur so that the point you are 
holding doesn’t move (immediately after the blow)? The moment of inertia 
about any axis through the CM of an equilateral triangle is ML 2 / 24. 

33. Not hitting the pole * 

A (possibly non-uniform) stick of mass to and length l lies on frictionless ice. 
Its midpoint (which is also its CM) touches a thin pole sticking out of the ice. 
One end of the stick is struck with a quick blow perpendicular to the stick, so 
that the CM moves away from the pole. What is the minimum value of the 
stick’s moment of inertia that allows the stick not to hit the pole? 


/ 


(top view) 


| m 

Figure 7.38 




• m 


l (top view) 


Figure 7.39 
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34. Up, down, and twisting ** 

A uniform stick is held horizontally and then released. At the same instant, 
one end is struck with a quick upwards blow. If the stick ends up horizontal 
when it returns to its original height, what are the possible values for the 
maximum height to which the stick’s center rises? 

35. Striking a pool ball ** 

At what height should you horizontally strike a pool ball so that it immediately 
rolls without slipping? 

36. Doing work * 

(a) A pencil of mass m and length i lies at rest on a frictionless table. You 
push on it at its midpoint (perpendicular to it), with a constant force F 
for a time t. Find the final speed and the distance traveled. Verify that 
the work you do equals the final kinetic energy. 

(b) Assume that you apply the same F for the same t as above, but that 
you now apply it at one of the pencil’s ends (perpendicular to the pen¬ 
cil). Assume that t is small, so that the pencil doesn’t have much time 
to rotate. 5 Find the final CM speed, the final angular speed, and the 
distance your hand moves. Verify that the work you do equals the final 
kinetic energy. 

37. Repetitive bouncing * 

Using the result of Problem 19, what must the relation between v x and Rlu 
be so that the superball continually bounces back and forth between the same 
two points of contact on the ground? 

38. Bouncing under a table ** 

You throw a superball so that it bounces off the floor, then off the underside 
of a table, then off the floor again. What must the initial relation between v x 
and Ruj be so that the ball returns to your hand (with the return and outward 
paths the same)? Use the result of Problem 19, and modifications thereof. 6 

39. Bouncing between walls *** 

A stick of length t slides on frictionless ice. It bounces between two parallel 
walls, a distance L apart, in such a way that the same end touches both walls, 
and the stick hits the walls at an angle 6 each time. What is 9, in terms of L 
and £1 What does the situation look like in the limit L <C £1 

5 This means that you can assume that your force is always essentially perpendicular to the 
pencil, as far as the torque is concerned. 

6 You are strongly encouraged to bounce a ball in such a manner and have it magically come 
back to your hand. It turns out that the required value of w is small, so a natural throw with w w 0 
will essentially get the job done. 
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What should 9 be, in terms of L and £, if the stick makes an additional n full 
revolutions between the walls? What is the minimum value of L/l for which 
this is possible? 
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7.8 Problems 



Figure 7.40 



Figure 7.41 



Section 7.1: Pancake object in x-y plane 

1. Leaving the sphere ** 

A ball with moment of inertia ymr 2 rests on top of a fixed sphere. There is 
friction between the ball and the sphere. The ball is given an infinitesimal kick 
and rolls down without slipping. Assuming that r is much smaller than the 
radius of the sphere, at what point does the ball lose contact with the sphere? 
How does your answer change if the size of the ball is comparable to, or larger 
than, the size of the sphere? You may want to solve Problem 4.3 first, if you 
haven’t already done so. 

2. Sliding ladder *** 

A ladder of length £ and uniform mass density stands on a frictionless floor and 
leans against a frictionless wall. It is initially held motionless, with its bottom 
end an infinitesimal distance from the wall. It is then released, whereupon the 
bottom end slides away from the wall, and the top end slides down the wall 
(see Fig. 7.40). When it loses contact with the wall, what is the horizontal 
component of the velocity of the center of mass? 

3. Leaning rectangle *** 

A rectangle of height 2a and width 2b rests on top of a fixed cylinder of radius 
R (see Fig. 7.41). The moment of inertia of the rectangle around its center 
is I. The rectangle is given an infinitesimal kick, and then “rolls” on the 
cylinder without slipping. Find the equation of motion for the tilt angle of the 
rectangle. Under what conditions will the rectangle fall off the cylinder, and 
under what conditions will it oscillate back and forth? Find the frequency of 
these small oscillations. 

4. Mass in a tube *** 

A tube of mass M and length i is free to swing by a pivot at one end. A mass 
m is positioned inside the tube at this end. The tube is held horizontal and 
then released (see Fig. 7.42). Let 9 be the angle of the tube with respect to 
the horizontal, and let x be the distance the mass has traveled along the tube. 
Find the Euler-Lagrange equations for 9 and x, and then write them in terms 
of 9 and y = x/£ (the fraction of the distance along the tube). 

These equations can only be solved numerically, and you must pick a numerical 
value for the ratio r = m/M in order to do this. Write a program (see 
Appendix D) that produces the value of y when the tube is vertical (9 = 7r/2). 
Give this value of y for a few values of r. 


Figure 7.42 
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Section 7.3: Calculating moments of inertia 

5. Slick calculations of I ** 

In the spirit of Section 7.3.2, find the moments of inertia of the following 
objects (see Fig. 7.43). 

(a) A uniform square of mass m and side i (axis through center, perpendic¬ 
ular to plane). 

(b) A uniform equilateral triangle of mass m and side t (axis through center, 
perpendicular to plane). 

6. Slick calculations of I for fractal objects *** 

In the spirit of Section 7.3.2, find the moments of inertia of the following fractal 
objects. (Be careful how the mass scales.) 




Figure 7.43 


(a) Take a stick of length £, and remove the middle third. Then remove the 
middle third from each of the remaining two pieces. Then remove the 
middle third from each of the remaining four pieces, and so on, forever. 
Let the final object have mass m (axis through center, perpendicular to 
stick; see Fig. 7.44). 7 

(b) Take a square of side £, and remove the “middle” square (1/9 of the 
area). Then remove the “middle” square from each of the remaining 
eight squares, and so on, forever. Let the final object have mass m (axis 
through center, perpendicular to plane; see Fig. 7.45). 

(c) Take an equilateral triangle of side £. and remove the “middle” triangle 
(1/4 of the area). Then remove the “middle” triangle from each of the 
remaining three triangles, and so on, forever. Let the final object have 
mass m (axis through center, perpendicular to plane; Fig. 7.46). 

7. Minimum I 

A moldable blob of matter of mass M is to be situated between the planes 
z = 0 and z = 1 (see Fig. 7.47) so that the moment of inertia around the 
z-axis be as small as possible. What shape should the blob take? 


Figure 7.44 



Figure 7.46 


7 This object is the Cantor set, for those who like such things. It has no length, so the density 
of the remaining mass is infinite. If you suddenly develop an aversion to point masses with infinite 
density, simply imagine the above iteration being carried out only, say, a million times. 



Figure 7.47 
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A— 1 —A 


Figure 7.48 



Figure 7.49 



Figure 7.50 


Section 7-4- Torque 

8. Zero torque from internal forces ** 

Given a collection of particles with positions r j, let the force on the ith particle, 
due to all the others, be Fj nt . Assuming that the force between any two 
particles is directed along the line between them, use Newton’s third law to 
show that r i x FJ nt = 0. 

9. Removing a support * 

(a) A uniform rod of length i and mass m rests on supports at its ends. The 
right support is quickly removed (see Fig. 7.48). What is the force on 
the left support immediately thereafter? 

(b) A rod of length 2 r and moment of inertia qmr 2 rests on top of two 
supports, each of which is a distance d away from the center. The right 
support is quickly removed (see Fig. 7.48). What is the force on the left 
support immediately thereafter? 

10. Oscillating ball ** 

A small ball with radius r and uniform density rolls without slipping near the 
bottom of a fixed cylinder of radius R (see Fig. 7.49). What is the frequency 
of small oscillations? Assume 

11. Oscillating cylinders ** 

A hollow cylinder of mass M\ and radius R\ rolls without slipping on the 
inside surface of another hollow cylinder of mass M 2 and radius i? 2 - Assume 
R\ <C J? 2 - Both axes are horizontal, and the larger cylinder is free to rotate 
about its axis. What is the frequency of small oscillations? 

12. A triangle of cylinders *** 

Three identical cylinders with moments of inertia I = ijMR 2 are situated in a 
triangle as shown in Fig. 7.50. Find the initial downward acceleration of the 
top cylinder for the following two cases. Which case has a larger acceleration? 

(a) There is friction between the bottom two cylinders and the ground (so 
they roll without slipping), but there is no friction between any of the 
cylinders. 

(b) There is no friction between the bottom two cylinders and the ground, 
but there is friction between the cylinders (so they don’t slip with respect 
to each other). 
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13. Falling stick * 

A massless stick of length b has one end attached to a pivot and the other end 
glued perpendicularly to the middle of a stick of mass m and length £. 


_ b ^ 


(a) If the two sticks are held in a horizontal plane (see Fig. 7.51) and then 
released, what is the initial acceleration of the CM? 

(b) If the two sticks are held in a vertical plane (see Fig. 7.51) and then 
released, what is the initial acceleration of the CM? 



14. Lengthening the string ** 

A mass hangs from a massless string and swings around in a horizontal circle, 
as shown in Fig. 7.52. The length of the string is very slowly increased (or 
decreased). Let 9, £, r, and h be defined as shown. 

(a) Assuming 9 is very small, how does r depend on £? 

(b) Assuming 9 is very close to 7 t/ 2, how does h depend on £? 

15. Falling Chimney **** 

A chimney initially stands upright. It is given a tiny kick, and it topples over. 
At what point along its length is it most likely to break? 

In doing this problem, work with the following two-dimensional simplified 
model of a chimney. Assume that the chimney consists of boards stacked on 
top of each other, and that each board is attached to the two adjacent ones 
with tiny rods at each end (see Fig. 7.53). The goal is to determine which 
rod in the chimney has the maximum tension. (Work in the approximation 
where the width of the chimney is very small compared to its height.) 

Section 7.5: Collisions 

16. Ball hitting stick ** 

A ball of mass M collides with a stick with moment of inertia / = r/mi 2 
(relative to its center, which is its CM). The ball is initially traveling with 
velocity Vo perpendicular to the stick. The ball strikes the stick at a distance 
d from the center (see Fig. 7.54). The collision is elastic. Find the resulting 
translational and rotational speeds of the stick, and also the resulting speed 
of the ball. 

17. A ball and stick theorem ** 

Consider the setup in Problem 16. Show that the relative speed of the ball 
and the point of contact on the stick is the same before and immediately after 
the collision. (This result is analogous to the “relative speed” result for a 1-D 
collision, Theorem 4.3.) 


Figure 7.51 



Figure 7.52 



Figure 7.53 


I=r\ml 2 


M y 


Figure 7.54 
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Figure 7.55 


Section 7.6: Angular Impulse 

18. Sliding to rolling ** 

A ball initially slides without rotating on a horizontal surface with friction 
(see Fig. 7.55). The initial speed of the ball is Vo, and the moment of inertia 
about its center is I = ymR 2 . 

(a) Without knowing anything about the nature of the friction force, find 
the speed of the ball when it begins to roll without slipping. Also, find 
the kinetic energy lost while sliding. 

(b) Now consider the special case where the coefficient of kinetic friction is 
fi, independent of position. At what time, and at what distance, does the 
ball begin to roll without slipping? Verify that the work done by friction 
equals the energy loss calculated in part (a). (Be careful on this.) 


19. The superball ** 

A ball with radius R and / = (2/5 )mR 2 is thrown through the air. It spins 
around the axis perpendicular to the plane of the motion (call this the x- 
y plane). The ball bounces off a floor without slipping during the time of 
contact. Assume that the collision is elastic, and that the magnitude of the 
vertical v y is the same before and after the bounce. Show that v' x and u/ after 
the bounce are related to v x and u before the bounce by 


f < 

\ V 

3 -4\ 

( % ^ 

[ Rw' 

' 7 l 

-10 -3 ) 

1 ^ ) 


where positive v x is to the right, and positive u> is counterclockwise. 


20. Many bounces * 

Using the result of Problem 19, describe what happens over the course of many 
superball bounces. 


21. Rolling over a bump ** 

A ball with radius R and / = (2/5 )mR 2 rolls with speed Vo without slipping 
on the ground. It encounters a step of height h and rolls up over it. Assume 
that the ball sticks to the corner of the step briefly (until the center of the ball 
is directly above the corner). Show that if the ball is to climb over the step, 
then Vo must satisfy 

Vo > 
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22. Lots of sticks *** 

Consider a collection of rigid sticks of length 2r, masses m,. and moments of 
inertia rfiriir 2 , with mi > m 2 > m 3 > •••. The CM of each stick is located 
at the center. The sticks are placed on a horizontal frictionless surface, as 
shown in Fig. 7.56. The ends overlap a negligible distance, and the ends are 
a negligible distance apart. 

The first (heaviest) stick is given an instantaneous blow (as shown) which 
causes it to translate and rotate. The first stick will strike the second stick, 
which will then strike the third stick, and so on. Assume all the collisions are 
elastic. 

Depending on the size of //, the speed of the rath stick will either (1) approach 
zero, (2) approach infinity, or (3) be independent of ra, as ra —> 00 . Show that 
the special value of 77 corresponding to the third of these three scenarios is 
rj = 1/3, which happens to correspond to a uniform stick. 



Figure 7.56 
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7.9 Solutions 


1. Leaving the sphere 


In this setup, as in Problem 4.3, the ball leaves the sphere when the normal force 
becomes zero, that is, when 


—— = mg cos 9. 
R 


(7.61) 


The only change from the solution to Problem 4.3 comes in the calculation of v. The 
ball now has rotational energy, so conservation of energy gives mgR( 1 — cos 9) = 
mv 2 / 2 + IuJ 2 /2 = mo 2 /2 + ijmr 2 w 2 /2. But ru> = v, so we have 


- (1 + rj)mv 2 =mgR(l- cos 9) *4 i 

Plugging this into eq. (7.61), we see that the ball 1< 
2 

cos 9 = -. 

3 + T) 


2gR(l — cos 9) 

1 + 5? 

the sphere when 


(7.62) 


(7.63) 


Remarks: For g = 0, this equals 2/3, as in Problem 4.3. For a uniform ball with g = 2/5, 
we have cos 0 = 10/17, so 9 « 54°. For g — * oo (for example, a spool with a very thin axle 
rolling down the rim of a circle), we have cos 9 —> 0, so 6 ss 90°. This makes sense because 
v is always very small, because most of the energy takes the form of rotational energy. £ 


If the size of the ball is comparable to, or larger than, the size of the sphere, then we 
must take into account the fact that the CM of the ball does not move along a circle 
of radius R. Instead, it moves along a circle of radius R + r, so eq. (7.61) becomes 

= mg cos 9. (7-64) 

R + r 

Also, the conservation-of-energy equation takes the form, mg(R + r)(l — cos 0) = 
mv 2 / 2 + gmr 2 oj 2 / 2. But ruj still equals v (prove this), so the kinetic energy still 
equals (1 + g)mv 2 / 2. 8 We therefore have the same equations as above, except that 
R is replaced everywhere by R + r. But R didn’t appear in the result for 9 in eq. 
(7.63), so the answer is unchanged. 


Remark: Note that the method of the second solution to Problem 4.3 will not work in this 
problem, because there is a force available to make v x decrease, namely the friction force. 
And indeed, v x does decrease before the rolling ball leaves the sphere. The v in the present 
problem is simply 1/yT + g times the v in Problem 4.3, so the maximum v x is still achieved 
at cos 0 = 2/3. But the angle in eq. (7.63) is larger than this. (However, see Problem 2 for 
a setup involving rotations where the max v x is relevant.) $ 

2. Sliding ladder 

The important point to realize in this problem is that the ladder loses contact with 
the wall before it hits the ground. Let’s find where this loss of contact occurs. 

Let r = i/ 2, for convenience. While the ladder is in contact with the wall, its CM 
moves in a circle of radius r. This follows from the fact that the median to the 
hypotenuse of a right triangle has half the length of the hypotenuse. Let 9 be the 

8 In short, the ball can be considered to be instantaneously rotating around the contact point, 
so the parallel-axis theorem leads to the factor of (1 + g) in the rotational kinetic energy around 
this point. 
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angle between the wall and the radius from the corner to the CM; see Fig. 7.57. This 
is also the angle between the ladder and the wall. 

We’ll solve this problem by assuming that the CM always moves in a circle, and then 
determining the position at which the horizontal CM speed starts to decrease, that 
is, the point at which the normal force from the wall would have to become negative. 
Since the normal force of course can’t be negative, this is the point where the ladder 
loses contact with the wall. 

By conservation of energy, the kinetic energy of the ladder equals the loss in potential 
energy, which is mgr( 1 — cos 9). This kinetic energy may be broken up into the CM 
translational energy plus the rotation energy. The CM translational energy is simply 
mr 2 9 2 / 2, because the CM travels in a circle of radius r. The rotational energy is 
I9 2 /2. The same 9 applies here as in the CM translational motion, because 9 is the 
angle between the ladder and the vertical, and thus is the angle of rotation of the 
ladder. 

Letting I = r)mr 2 to be general (g 1/3 for our ladder), the conservation of energy 
statement is (l + rj)mr 2 9 2 /2 = mgr(l — cos 9). Therefore, the speed of the CM, which 
is v = r9, equals 



The horizontal component of this is 


>s 9) cos 9. 


(7.66) 


Taking the derivative of ■y/j(T— cos 9) cos 9, we see that the horizontal speed is maxi¬ 
mum when cos 9 = 2/3. Therefore the ladder loses contact with the wall when 


cos 9 = 


2 

3 


(7.67) 


Note that this is independent of g. This means that, for example, a dumbbell (two 
masses at the ends of a massless rod, with g = 1) will lose contact with the wall at 
the same angle. 


Plugging this value of 9 into eq. (7.66), and using g = 1/3, 
speed of 

= \/Wr = Vg£ 

x '% - 3 


obtain a final horizontal 


(7.68) 


Note that this is 1/3 of the \j2gr horizontal speed that the ladder would have if it 
were arranged (perhaps by having the top end slide down a curve) to eventually slide 
horizontally along the ground. 

You are encouraged to compare various aspects of this problem with those in Problem 
1 and Problem 4.3. 


Remark: The normal force from the wall is zero at the start and finish, so it must reach a 
maximum at some intermediate value of 9. Let’s find this 9. Taking the derivative of v x in 
eq. (7.66) to find the CM’s horizontal acceleration a x , and then using 9 oc Iplg— cos 9 from 
eq. (7.65), we see that the force from the wall is proportional to 



Figure 7.57 


.9(3 cos 9 — 2). 


(7.69) 





Figure 7.58 
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Taking the derivative of this, we find that the force from the wall is maximum when 


* (7.70) 


3. Leaning rectangle 

We must first find the position of the rectangle’s CM when it has rotated through an 
angle 9. Using Fig. 7.58, we can obtain this position (relative to the center of the 
cylinder) by adding up the distances along the three shaded triangles. Note that the 
contact point has moved a distance R0 along the rectangle. We find that the position 
of the CM is 


(x,y) = R(sm0,cose) + R0(-cos0,sm0) + a{sm9,cos0), (7.71) 

We’ll now use the Lagrangian method to find the equation of motion and the frequency 
of small oscillations. Using eq. (7.71), you can show that the square of the speed of 
the CM is 

v 2 = x 2 + y 2 = (a 2 + R 2 9 2 )9 2 . (7.72) 

Remark: The simplicity of this result suggests that there is a quicker way to obtain it. 
And indeed, the CM instantaneously rotates around the contact point with angular speed 
9, and from Fig. 7.58 the distance to the contact point is \/a 2 + R 2 9 2 . Therefore, the speed 
of the CM is ojr = 9sJa 2 + R?9 2 . * 

The Lagrangian is 

C = T-V= ^ m(a 2 + R 2 9 2 )9 2 + ^ I9 2 - mg((R + a ) cos 6 + R9sm9^j . (7.73) 

The equation of motion is (as you can show) 

( ma 2 + mR 2 0 2 + 1)9 + mR 2 99 2 = mga sin 9 — mgR6 cos 9. (7.74) 

Let us now consider small oscillations. Using the small-angle approximations, sin 9 ~ 9 
and cos 9 ~1- 9 2 /2 , and keeping terms only to first order in 9, we obtain 

(ma 2 + 1)9 + mg(R - a)0 = 0. (7.75) 

The coefficient of 9 is positive if a < R. Therefore, oscillatory motion occurs for a < R. 
Note that this condition is independent of b. The frequency of small oscillations is 

(7.76) 

Remarks: Let’s look at some special cases. If I = 0 (that is, all of the rectangle’s mass is 
located at the CM), we have w!■= \Jg{R — a)/a 2 . If in addition a <C R, then w » y/gR/a 2 . 
You can also derive these results by considering the CM to be a point mass sliding in a 
parabolic potential. If the rectangle is instead a uniform horizontal stick, so that a <S R, 
a -C b, and I « mb 2 /3. then we have u> « \J'3gR/b 2 . If the rectangle is a vertical stick 
(satisfying a < R), so that 6<o and I » ma 2 / 3, then we have w « ^3 g(R — a) /4a 2 . If in 
addition fl«E, then ui « ^J'igR/Aa 2 . 

Without doing much work, there are two other ways we can determine the condition under 
which there is oscillatory motion. The first is to look at the height of the CM. Using small- 
angle approximations in eq. (7.71), the height of the CM is y « (R + a) + (R — a)9 2 / 2. 
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Therefore, if a < R, the potential energy increases with 9, so the rectangle wants to decrease 
its 6 and fall back down to the middle. But if a > R, the potential energy decreases with 9, 
so the rectangle wants to increase its 9 and fall off the cylinder. 

The second way is to look at the horizontal positions of the CM and the contact point. 
Small-angle approximations in eq. (7.71) show that the former equals a9 and the latter 
equals R9. Therefore, if a < R then the CM is to the left of the contact point, so the torque 
from gravity (relative to the contact point) makes 9 decrease, and the motion is stable. But 
if a > R then the torque from gravity makes 9 increase, and the motion is unstable. £ 

4. Mass in a tube 
The Lagrangain is 




where r = m/M and g = g/I. Below is a Maple program that numerically finds the 
value of r] when 9 equals n/2, in the case where r = 1. As mentioned in Problem 2 
in Appendix B, this value of r) does not depend on g or £, and hence not g. In the 
program, we’ll denote g by g, which we’ll give the arbitrary value of 10. We’ll use 
q for 6, and n for r). Also, we denote 9 by ql and 9 by q2, etc. Even if you don’t 
know Maple, this program should still be understandable. See Appendix D for more 
discussion on solving differential equations numerically. 


n:=0: # initial n value 

nl:=0: # initial n speed 

q:=0: # initial angle 

ql:=0: # initial angular speed 

e:=.0001: # small time interval 

g:=10: # value of g/1 

r:=l: # value of m/M 

while q<l.57079 do # do this process until the angle is pi/2 
n2:=n*ql~2+g*sin(q): # the first E-L equation 

q2:=((3*r*g*n+3*g/2)*cos(q)-6*r*n*nl*ql)/(l+3*r*n~2): 

# the second E-L equation 
n:=n+e*nl: # how n changes 

nl:=nl+e*n2: # how nl changes 
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q:=q+e*ql: # how q changes 

ql:=ql+e*q2: # how ql changes 

end do: # stop the process 

n; # print the value of n (eta) 

The resulting value for 77 is 0.378. If you actually run this program on Maple with 
different values of g, you will find that the result for n doesn’t depend on g, as we 
stated above. A few results for 77 for various values of r are, in (r, rj) notation: (0, .349), 
(1, .378), (2, .410), (10, .872), (20,3.290). It turns out that r w 11.25 yields 77 m 1. 
That is, the mass m gets to the end of the tube right when the tube becomes vertical. 
For 77 values larger than 1, we could imagine attaching a massless tubular extension 
on the end of the given tube. It turns out that 77 —> 00 as r —> 00 . In this case, the 
mass m essentially drops straight down, causing the tube to quickly swing down to 
a nearly vertical position. But m ends up slightly to one side, and then takes a very 
long time to move over to become directly below the pivot. 

5. Slick calculations of I 

(a) We claim that the I for a square of side 2£ is 16 times the I for a square of side £, 
assuming that the axes pass through any two corresponding points. This is true 
because dm goes like the area, which is proportional to length squared, so the 
corresponding dm’s differ by a factor of 4. And then there are the two powers 
of r in the integrand. Therefore, when changing variables from one square to 
the other, there are four powers of 2 in the integral / r 2 dm = f r 2 pdxdy. 

As is Section 7.3.2, we can express the relevant relations in terms of pictures: 

= i<s Q 

-d 

□ = 0+"(vf) 2 

The first line comes from the scaling argument, the second comes from the fact 
that moments of inertia simply add, and the third comes from the parallel-axis 
theorem. Equating the right-hand sides of the first two, and then using the third 
to eliminate □ gives 



This agrees with the result of Example 12 in Section 7.3.1, with a = b = l. 

(b) This is again a two-dimensional object, so the I for a triangle of side 2£ is 16 
times the I for a triangle of side l, assuming that the axes pass through any two 
corresponding points. With pictures, we have: 
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A = ^ ! <'N 

•D> = A + Kw) 2 

The first line comes from the scaling argument, the second comes from the fact 
that moments of inertia simply add, and the third comes from the parallel-axis 
theorem. Equating the right-hand sides of the first two, and then using the third 
to eliminate •r> gives 



This agrees with the result of Example 11 in Section 7.3.1, with N = 3. The 
“radius” R used in that example equals £/\/3 in the present notation. 

6. Slick calculations of I for fractal objects 

(a) The scaling argument here is a little trickier than that in Section 7.3.2. Our 
object is self-similar to an object 3 times as big, so let’s increase the length by 
a factor of 3 and see what happens to I. In the integral / x 2 dm, the x’s pick 
up a factor of 3, so this gives a factor of 9. But what happens to the dm? Well, 
tripling the size of our object increases its mass by a factor of 2, because the 
new object is simply made up of two of the smaller ones, plus some empty space 
in the middle. So the dm picks up a factor of 2. Therefore, the I for an object 
of length 3£ is 18 times the I for an object of length t, assuming that the axes 
pass through any two corresponding points. 

With pictures, we have (the following symbols denote our fractal object): 

37 i8 1 


= 2 (-" 


-) 


•-= + ml 2 

The first line comes from the scaling argument, the second comes from the fact 
that moments of inertia simply add, and the third comes from the parallel-axis 
theorem. Equating the right-hand sides of the first two, and then using the third 
to eliminate • — — gives 
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This is larger than the I for a uniform stick, namely mi 2 /12, because the mass 
here is generally farther away from the center. 

Remark: When we increase the length of our object by a factor of 3 here, the factor 
of 2 in the dm is larger than the factor of 1 relevant to a zero-dimensional object, but 
smaller than the factor of 3 relevant to a one-dimensional object. So in some sense our 
object has a dimension between 0 and 1. It is reasonable to define the dimension, d, of 
an object as the number for which r d is the increase in “volume” when the dimensions 
are increased by a factor of r. In this problem, we have 3 d = 2, so d = log 3 2 « 0.63. 
* 

(b) Again, the mass scales in a strange way. Let’s increase the dimensions of our 
object by a factor of 3 and see what happens to I. In the integral / x 2 dm, the 
x’s pick up a factor of 3, so this gives a factor of 9. But what happens to the 
dm? Tripling the size of our object increases its mass by a factor of 8, because 
the new object is made up of eight of the smaller ones, plus an empty square 
in the middle. So the dm picks up a factor of 8. Therefore, the / for an object 
of side M is 72 times the I for an object of side t, assuming that the axes pass 
through any two corresponding points. 

With pictures, we have (the following symbols denote our fractal object): 

3/ 

/ 

• = 72 E 


• =4 M. ) 

. = .| + ml 1 

= . + m(L2lf 

The first line comes from the scaling argument, the second comes from the fact 
that moments of inertia simply add, and the third and fourth come from the 
parallel-axis theorem. Equating the right-hand sides of the first two, and then 
using the third and fourth to eliminate . and _ gives 



This is larger than the I for the uniform square in Problem 5, namely mi 2 /6 , 
because the mass here is generally farther away from the center. 

Note: Increasing the size of our object by a factor of 3 increases the “volume” by a 
factor of 8. So the dimension is given by 3 d = 8 ==> d = log 3 8 « 1.89. 

(c) Again, the mass scales in a strange way. Let’s increase the dimensions of our 
object by a factor of 2 and see what happens to I. In the integral f x 2 dm, the 
ar’s pick up a factor of 2, so this gives a factor of 4. But what happens to the 
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dm? Doubling the size of our object increases its mass by a factor of 3, because 
the new object is simply made up of three of the smaller ones, plus an empty 
triangle in the middle. So the dm picks up a factor of 3. Therefore, the I for 
an object of side 2 £ is 12 times the I for an object of side £, assuming that the 
axes pass through any two corresponding points. 

With pictures, we have (the following symbols denote our fractal object): 

21 


A 


The first line comes from the scaling argument, the second comes from the fact 
that moments of inertia simply add, and the third comes from the parallel-axis 
theorem. Equating the right-hand sides of the first two, and then using the third 
to eliminate • gives 





This is larger than the / for the uniform triangle in Problem 5, namely ml 2 /12, 
because the mass here is generally farther away from the center. 

Note: Note: Increasing the size of our object by a factor of 2 increases the “volume” 
by a factor of 3. So the dimension is given by 2 d = 3 d = log 2 3 « 1.58. 

7. Minimum I 

The shape should be a cylinder with the 2 -axis as its symmetry axis. A quick proof 
(by contradiction) is as follows. 

Assume that the optimal blob is not a cylinder, and consider the surface of the blob. 
If the blob is not a cylinder, then there exist two points on the surface, Pi and P 2 , 
that are located at different distances, ri and r 2 , from the 2 -axis. Assume ri < r -2 
(see Fig. 7.59). If we move a small piece of the blob from P 2 to Pi, then we decrease 
the moment of inertia, f r 2 p dV. Therefore, the proposed non-cylindrical blob cannot 
be the one with the smallest I. 

In order to avoid this contradiction, all points on the surface must be equidistant from 
the z-axis. The only blob with this property is a cylinder. 

8. Zero torque from internal forces 

Let F”)* be the force that the ith particle feels from the jth particle (see Fig. 7.60). 
Then __ 

F int = ^ F in t _ (7.80) 



Figure 7.59 



Figure 7.60 
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l 



Figure 7.61 


The total internal torque, relative to the chosen origin, is therefore 

r int = E x F * nt = E E r * x • ( 7 - 81 ) 

But if we interchange the indices (which were labeled arbitrarily), we have 

^ int = EE r i xF “ = -EE r i x F «> ( 7 - 82 ) 

where we have used Newton’s third law, F’T = —F'T. Adding the two previous 
equations gives 

^ int = EE( r *- r } ) xF T (7- 83 ) 

i 3 

But F"-* is parallel to (r, — r 7 ), by assumption. Therefore, each cross product in the 
sum equals zero. 

The above sums might make this solution look a bit involved. But the idea is simply 
that the torques cancel in pairs. This is clear from Fig. 7.60, because the two forces 
shown are equal and opposite, and they have the same lever arm relative to the origin. 

9. Removing a support 

(a) First Solution: Let the desired force on the left support be F, and let the 
downward acceleration of the stick’s CM be a. Then the F = ma and t = la 
(relative to the fixed support; see Fig. 7.61) equations, along with the circular- 
motion relation between a and a, are 

mg- F 

t 

mg 2 



d 



Figure 7.62 


The second equation gives a = 3g/2£. The third equation then gives a = 3g/4. 
And the first equation then gives F = mg/4. Note that the right end of the 
stick accelerates at 2 a = 3g/2, which is larger than g. 

Second Solution: Looking at torques around the CM, we have 


And looking at torques around the fixed support, we have 


Dividing the first of these equations by the second gives F = mg/ 4. 

(b) First Solution: As in the first solution above, we have (using the parallel-axis 
theorem; see Fig. 7.62) 

mg — F = ma, 

mgd = (gmr 2 + md 2 )a 
a = da. 


(7.87) 
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Solving for F gives F = mg/( 1 + d 2 /gr 2 ). For d = r and g = 1/3, we obtain 
the answer in part (a). 

Second Solution: As in the second solution above, looking at torques around 
the CM, we have 

Fd = (gmr 2 )a. (7.88) 

And looking at torques around the fixed support, we have 

mgd = ( gmr 2 + md 2 )a. (7.89) 


Dividing the first of these equations by the second gives F = mg/( 1 + d 2 /gr 2 ). 


Some limits: For the special case d = r, we have the following: If g = 0 then F = 0; 
if g = 1 then F = mg/2-, and if g = oo (we could put masses at the ends of massless 
extensions of the stick) then F = mg-, these all make intuitive sense. In the limit d = 0, 
we have' F — mg. And in the limit d = oo, we have F = 0. Technically, we should be 
writing d <€. y/gr or d » ^fgr here. 

10. Oscillating ball 

Let the angle from the bottom of the cylinder to the ball be 0 (see Fig. 7.63), and 
let Ff be the friction force. Then the tangential F = ma equation is 

F f — mg sin 9 = ma, (7.90) 

where we have chosen rightward to be the positive direction for a and Ff. Also, the 
t = la equation (relative to the CM) is 


R 


Figure 


—rFf = 2 -mr 2 a , (7.91) 

where we have chosen clockwise to be the positive direction for a. Using ra = a, the 
torque equation becomes Ff = —(2/5)mo. Plugging this into eq. (7.90), and using 
sin# w 9, we obtain mgO + (7/5 )ma = 0. Under the assumption r <C R, we have 
a w R0, so we arrive at 

§+ {^h) e=0 - (7 - 92) 

This is the equation for simple harmonic motion with frequency 


u = 


[W 

V 7 R 


(7.93) 


This answer is slightly smaller than the *J.g/R answer for the case where the ball slides. 
The rolling ball effectively has a larger inertial mass, but the same gravitational mass. 
You can also solve this problem by using the contact point as the origin around which 
r and L are calculated. 


Remarks: If we omit the r <C R assumption, you can show that ra = a still holds, 
but a = R9 is replaced by a = (R — r)9. Therefore, the exact result for the frequency is 
tv = yJhg/7(R — r). This goes to infinity as r —> R. 

In general, if the ball has a moment of inertia equal to gmr 2 , you can show that the frequency 
of small oscillations equals \fg/{ 1 + g)R. 
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Figure 7.64 



Figure 7.65 


11. Oscillating cylinders 

The moments of inertia of the cylinders are simply I\ = M\ R'j and I 2 = Let 

F be the force between the two cylinders. And let 0\ and 62 be the angles of rotation 
of the cylinders (with counterclockwise positive), relative to the position where the 
small cylinder is at the bottom of the big cylinder. Then the torque equations are 

FRi = MiR\9i, 

FR 2 = -M 2 R\e 2 . (7.94) 


We are not so much concerned with 0-\ and 62 as we are with the angular position that 
Mi makes with the vertical. Call this angle 9 (see Fig. 7.64). In the approximation 
Ri 7?2, the non-slipping condition says that R 26 » i? 2#2 — R\9\. Eqs. (7.94) then 


give 


(7.95) 


The force equation on Mi is 


F — M-, g sin 0 = M 1 (R 2 6 ). 


(7.96) 


Substituting the F from (7.95) into this gives (with sin 6 ~ 6 ) 



(Mig 

V R2 


9 = 0 . 


The frequency of small oscillations is therefore 


l~g~ I Mi + M 2 

V j?2 V Ml + 2M2 


(7.97) 


(7.98) 


Remarks: In the limit M2 -C Ml, we obtain uj « yfgjlh- There is essentially no friction 
force between the cylinders; only a normal force. So the small cylinder essentially acts like a 
pendulum of length R2. In the limit Mi <C M2, we obtain u> « yJg/2R2. The large cylinder 
is essentially fixed, so we simply have the setup mentioned in the remark in the solution to 
Problem 10, wit h »} = 1. Jft 

12. A triangle of circles 

(a) Let N be the normal force between the cylinders, and let F be the friction force 
from the ground (see Fig. 7.65). Let a x be the initial horizontal acceleration of 
the right bottom cylinder (so a = a x /R is its angular acceleration), and let a y 
be the initial vertical acceleration of the top cylinder (with downward taken to 
be positive). 

If we consider the torque around the center of one of the bottom cylinders, then 
the only relevant force is F, because N, gravity, and the normal force from the 
ground all point through the center. The equations expressing F x = Ma x on 
the bottom right cylinder, F y = Ma y on the top cylinder, and r = la on the 
bottom right cylinder are, respectively, 

ZVcos60°— F = Ma x , 

Mg -2N sin 60° = Ma y , 

FR = ( r/MR 2 )(a x /R ). (7.99) 
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We have four unknowns, N, F, a x , and a y . So we need one more equation. 
Fortunately, a x and a y are related. The contact surface between the top and 
bottom cylinders lies (initially) at an angle of 30° with the horizontal. Therefore, 
if the bottom cylinders move a distance d to the side, then the top cylinder moves 
a distance d tan 30° downward. Hence, 

a x = V3a y . (7.100) 


We now have four equations in four unknowns. Solving for a y , by your method 
of choice, gives 


a y 


9 

7 + 6 ?? ' 


(7.101) 


(b) Let N be the normal force between the cylinders, and let F be the friction force 
between the cylinders (see Fig. 7.66). Let a x be the initial horizontal acceleration 
of the right bottom cylinder, and let a y be the initial vertical acceleration of 
the top cylinder (with downward taken to be positive). Let a be the angular 
acceleration of the right bottom cylinder (with counterclockwise taken to be 
positive). Note that a. is not equal to a x /R, because the bottom cylinders slip 
on the ground. 

If we consider the torque around the center of one of the bottom cylinders, then 
the only relevant force is F. And from the same reasoning as in part (a), we 
have a x = \/3a y . Therefore, the four equations analogous to eqs. (7.99) and 
(7.100) are 


N cos 60° — F sin 60° 
Mg — 2N sin 60° — 2Fcos60° 
FR 


Ma x , 

Ma y , 

(i rjMR 2 )a , 

VSdy. 


(7.102) 


We have five unknowns, N, F, d x , a y , and a. So we need one more equation. 
The tricky part is relating a to d x . One way to do this is to ignore the y motion 
of the top cylinder and imagine the bottom right cylinder to be rotating up and 
around the top cylinder, which is held fixed. If the bottom cylinder moves an 
infinitesimal distance d to the right, then its center moves a distance d/ cos 30° 
up and to the right. So the angle through which the bottom cylinder rotates 
is (d/cos30°)/i? = (2/\/3)(d/R). Bringing back in the vertical motion of the 
cylinders does not change this result. Therefore, 


a = 


J2_a* 


(7.103) 


We now have five equations and five unknowns. Solving for d y , by your method 
of choice, gives 


9 

7 + 8r]' 


(7.104) 


Remarks: If rj = 0, that is, if all the mass is at the center of the cylinders, 9 then the 
results in both parts (a) and (b) reduce to g/7. If y ^ 0, then the result in part (b) is 
smaller than that in part (a). This isn’t so obvious, but the basic reason is that the 
bottom cylinders in part (b) take up more energy because they have to rotate slightly 
faster, because a = (2/\/3 ){a x /R) instead of a = a x /R. Jit 

9 The r) = 0 case is also equivalent to all the surfaces simply being frictionless, because then 
nothing rotates. 



Figure 7.66 
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13. Falling stick 


(a) Let’s calculate r and L relative to the pivot point. The torque is due to gravity, 
which effectively acts on the CM and has magnitude mgb. The moment of inertia 
of the stick around the horizontal axis through the pivot (and perpendicular to 
the massless stick) is simply mb 2 . So when the stick starts to fall, the r = dL/dt 
equation is mgb = ( mb 2 )a. Therefore, the initial acceleration of the CM, namely 
ba, is 

ba = g, (7.105) 

which is independent of £ and b. This answer makes sense. The stick initially 
falls straight down, and the pivot provides no force because it doesn’t know right 
away that the stick is moving. 

(b) The only change from part (a) is the moment of inertia of the stick around 
the horizontal axis through the pivot (and perpendicular to the massless stick). 
From the parallel-axis theorem, this moment is mb 2 +m£ 2 / 12. So when the stick 
starts to fall, the r = dL/dt equation is mgb = (mb 2 + nil 2 /12) a. Therefore, 
the initial acceleration of the CM is 


ba = 


9 

l + (£ 2 /12b 2 ) 


(7.106) 


For £ -C b, this goes to g, as it should. And for £ b, it goes to zero, as it 
should. In this case, a tiny movement of the CM corresponds to a very large 
movement of the points far out along the stick. Therefore, by conservation of 
energy, the CM must be moving very slowly. 


14. Lengthening the string 

Consider the angular momentum L relative to the support point P. The forces on the 
mass are the tension in the string and gravity. The former provides no torque around 
P, and the latter provides no torque in the ^-direction. Therefore, L z is constant. If 
we let u>e be the frequency of the circular motion when the string has length £, then 
we can say that 

L z = mr 2 cj e (7.107) 

is constant. 

The frequency ui( can be obtained by using F = ma for the circular motion. The 
tension in the string is mg/ cos 0 (to make the forces in the y-direction cancel), so the 
horizontal radial force is mg tan 9. Therefore, 

mg tan 6 = mrJj = m(£ sin 9')J\ => (j e =, ^ =,/^. (7.108) 

V £cos9 V h 

Plugging this into eq. (7.107), we see that the constant value of L z is 

L z = mr 2 ^ . (7.109) 

Let’s now look at the two cases. 

(a) For 0 « 0, we have h~ £, so eq. (7.109) says that r 2 /\fl is constant. Therefore, 
r oc £ 1/4 , (7.110) 

which means that r grows very slowly with l. 
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(b) For 9 ~ 7t/ 2, we have r ~ so eq. (7.109) says that £ 2 /Vh is constant. 
Therefore, 

hoc £ 4 , (7.111) 

which means that h grows very quickly with £. 

Note that eq. (7.109) says that h oc r 4 for any value of 9. So if you slowly 
lengthen the string so that r doubles, then h increases by a factor of 16. 

15. Falling Chimney 

Before we start dealing with the forces in the rods, let’s first determine 9 as a function 
of 9 (the angle through which the chimney has fallen). Let l be the height of the 
chimney. Then the moment of inertia around the pivot point on the ground is mi 2 / 3 
(if we ignore the width), and the torque (around the pivot point) due to gravity is 
r = mg(£/2) sin0. Therefore, r = dL/dt gives mg(£/ 2) sin# = ( l/3)m£ 2 9 , or 


(7.112) 


Let’s now determine the forces in the rods. Our strategy will be to imagine that the 
chimney consists of a chimney of height h , with another chimney of height l — h placed 
on top of it. We’ll find the forces in the rods connecting these two “sub-chimneys,” 
and then we’ll maximize one of these forces (T2, defined below) as a function of h. 
The forces on the top piece are gravity and also the forces from the two rods at 
each end of the bottom board. Let’s break these latter forces up into transverse 
and longitudinal forces along the chimney. Let Ti and T 2 be the two longitudinal 
components, and let F be the sum of the transverse components, as shown in Fig. 7.67. 
We have picked the positive directions for T- t and T 2 so that positive Ti corresponds 
to a compression in the left rod, and positive T 2 corresponds to a tension in the right 
rod (which is what the forces will turn out to be, as we’ll see). It turns out that if the 
width (which we’ll call 2r) is much less than the height, then T 2 F (as we will see 
below), so the tension in the right rod is essentially equal to T 2 . We will therefore be 
concerned with maximizing T 2 . 

In writing down the force and torque equations for the top piece, we have three 
equations (the radial and tangential F = ma equations, and r = dL/dt around the 
CM), and three unknowns (F, Ti, and T 2 ). If we define the fraction / = h/£, then 
the top piece has length (1 — f)£ and mass (1 — f)m, and its CM travels in a circle of 
radius (1 + f)£/ 2, Therefore, our three force and torque equations are, respectively, 


T 2 — Ti + (1 — f)mg cos 9 
F + (1 — f)mg sin 9 


(Ti + T 2 )r 


„(W)< 


(W)»(^>t 

(W)«(^)e, 

(l-/)m( (1 ~^ a<a )». (7.113) 


At this point, we could plow forward and solve this system of three equations in 
three unknowns. But things simplify greatly in the limit where r -C £. The third 
equation says that Ti + T 2 is of order 1/r, and the first equation says that T 2 — Ti is 
of order 1. These imply that Ti « T 2 , to leading order in 1/r. Therefore, we may set 
Ti + T 2 ss 2T 2 in the third equation. Using this approximation, along with the value 



Figure 7.67 
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of 0 from eq. (7.112), the second and third equations become 
3 

F + (1 — f)mg sin 9 = -(1 — f 2 )mg sin0, 

2 rT^-F^—^- = ffmglAnO. (7.114) 

This first of these equations gives 

F=^(- I + 4 /-3A (7.115) 

and then the second gives 

8 r 

As stated above, this is much greater than F (because £/r 1), so the tension in the 

right rod is essentially equal to T 2 . Taking the derivative of T 2 with respect to /, we 
see that it is maximum at 

< 7117 > 

Therefore, the chimney is most likely to break at a point one-third of the way up 
(assuming that the width is much less than the height). Interestingly, / = 1/3 makes 
the force F in eq. (7.115) exactly equal to zero. 

16. Ball hitting stick 

Let V, v, and u> be the speed of the ball, the speed of the stick’s CM, and the angular 
speed of the stick, respectively, after the collision. Then conservation of momentum, 
angular momentum (around the fixed point that coincides with the initial center of 
the stick), and energy give 

MV 0 = MV + mv, 

MV 0 d = MVd+gmPu, 

MVf = MV 2 + mv 2 +rimfiu 2 . (7.118) 


We must solve these three equations for V, v, and u. The first two equations quickly 
give vd = t]1 2 uj. Solving for V in the first equation and plugging the result into the 
third, and then eliminating u> through vd = rj£ 2 u> gives 


2Vq 


1 + T& 


^ = ^0 


z W Vo 


M ' r rii 2 1 + M + r)P 

Having found v, the first equation above gives V as 

1-4SM 


V = Vo¬ 


id 


m ~ iit 2 
. m , di ■ 


You are encouraged to check various limits of these answers. 


(7.119) 


(7.120) 


Remark: Another solution to eqs. (7.118) is of course V = Vo, V = 0, and w = 0. The 
initial conditions certainly satisfy conservation of p, L, and E with the initial conditions. A 
fine tautology, indeed. Nowhere in eqs. (7.118) does it say that the ball actually hits the 
stick. X 
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17. A ball and stick theorem 

As in the solution to Problem 16, we have 

MV 0 = MV + mv, 

MV 0 d = MVd + Ico, 

MVq = MV 2 + mv 2 + loo 2 . (7.121) 

The speed of the contact point on the stick right after the collision equals the speed 
of the CM plus the rotational speed relative to the CM. In other words, it equals 
v + cod. The desired relative speed is therefore (v + cod) — V. We can determine the 
value of this relative speed by solving the above three equations for V, v, and co. Or 
equivalently, we can just use the results of Problem 16. There is, however, a much 
more appealing method, which is as follows. 

The first two equations quickly give mvd = loo. The last equation may then be 
written as, using loo 2 = (Ioo)oo = ( mvd)oo, 

M(V 0 -V)(V 0 + V) = mv(v + cod). (7.122) 

If we now write the first equation as 

M(V 0 -V) = mv , (7.123) 

we can divide eq. (7.122) by eq. (7.123) to obtain Vq + V = v + cod, or 

V 0 = (v + cod) - V, (7.124) 

as we wanted to show. In terms of velocities, the correct statement is that the final 
relative velocity is the negative of the initial relative velocity. In other words, Vo — 0 = 
-(V-(v + ood)). 

18. Sliding to rolling 


(a) Define all linear quantities to be positive to the right, and all angular quantities 
to be positive clockwise, as shown in Fig. 7.68. Then, for example, the friction 
force Ff is negative. The friction force slows down the translational motion and 
speeds up the rotational motion, according to 


F f = ma, 

—F f R = la, (7.125) 

where we have calculated the torque relative to the CM. Eliminating Ff, and 
using I = rjmR 2 , gives a = —r]Ra. Integrating this over time, up to the time 
when the ball stops slipping, gives 

AV = -rjRAco. (7.126) 


Note that we could have obtained this by simply using the impulse equation, 
eq. (7.58). Using AU = U/ — Vo, and A co = oof —ooo = cof, and also co/ = Vf/R 
(the non-slipping condition), eq. (7.126) gives 


Vf = — 5 

1 1 + V 


(7.127) 



Figure 7.68 
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independent of the nature of Ff. Ff can depend on position, time, speed, or 
anything else. The relation a = —riRa, and hence also eq. (7.126), will still be 
true at all times. 


Remark: We can also calculate r and L relative to a dot painted on the ground that 
is the contact point at a given instant. There is zero torque relative to this point. 
To find L, we must add the L of the CM and the L relative to the CM. Therefore, 
t = dL/dt gives 0 = ( d/dt)(mRv + gmR 2 uj), and so a = —gRa, as above. & 


Using eq. (7.127), and also the relation Wf = Vf/R, the loss in kinetic energy is 


ARE 


\ mV S ~ (l mV f + 

2 mV ° 0 ~ (TT^F ~ C i+w ) 

-mV. 2 ( 


(7.128) 


For r) —> 0, no energy is lost, which makes sense. And for tj —> oo, all the energy 
is lost, which also makes sense. This case is essentially like a sliding block which 
can’t rotate. 


(b) Let’s first find t. The friction force is Ff = —gmg , so F = ma gives —gg = a. 
Therefore, AU = at = —ggt. But eq. (7.127) says that AU = Vf — Vo = 
—Vor]/(l + r)). Therefore, 


(1 + rj) ng ' 


(7.129) 


For g —> 0, we have t —> 0, which makes sense. And for rj oc, we have 
t —> Vo/ijig) which is exactly the time a sliding block would take to stop. 

Let’s now find d. We have d = Vot + (1/2)at?. Using a = —gg, and plugging in 
the t from eq. (7.129), we obtain 


rj(2 + ri) U 0 2 
(1 + r?) 2 2gg ' 


(7.130) 


The two extreme cases for r) check here. 

To calculate the work done by friction, we might be tempted to write down 
the product Ffd, with Ff = —jirng and d given in eq. (7.130). But the result 
doesn’t look much like the loss in kinetic energy calculated in eq. (7.128). 
What’s wrong with this reasoning? The error is that the friction force does not 
act over a distance d. To find the distance over which Ff acts, we must find 
how far the surface of the ball moves relative to the ground. 

The speed of a dot on the ball that is instantaneously the contact point is 
U re i = V(t) — Ru>(t) = (Vo + at) — Rat. Using a = —a/r\R and a = —gg, this 
becomes 

u rel = Vo - W^-ggt. (7.131) 

*7 

Integrating this from t = 0 to the t given in eq. (7.129) gives 


^rel 


2gg(l + 7]) ' 


(7.132) 


The work done by friction is Ffd re \ = —gmgd re \, which does indeed give the 
in kinetic energy given in eq. (7.128). 
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19. The superball 

Since we are told that \v y \ is unchanged by the bounce, we can ignore it when applying 
conservation of energy. And since the vertical impulse from the floor provides no 
torque around the ball’s CM, we can completely ignore the y motion in this problem. 
The horizontal impulse from the floor is responsible for changing both v x and u>. With 
positive directions defined as in the statement of the problem, eq. (7.58) gives 


A L = RAp 

=> = Rm(v' x — v x ). 

But conservation of energy gives 

-mv ' 2 + -Iu ,2 = -mv 2 + -Ioj 2 

2 x ^2 2 

=► I(cj ,2 -u 2 ) = m{v 2 x -v' x ). 

Dividing this equation by eq. (7.133) gives 10 

R{ui' + u) = -{ v ' x + Vx ). 


(7.133) 


(7.134) 

(7.135) 


We can now combine this equation with eq. (7.133), which can be rewritten as, using 
I = (2/5) mi? 2 , 

?i?(u/ -u) =v' x -v x . (7.136) 


Given v x and u>, the previous two equations are two linear equations in the two 
unknowns, v' x and ui'. Solving for v' x and u)', and then writing the result in matrix 
notation, gives 



(7.137) 


as desired. As an exercise, you can use this result to show that the relative velocity 
of the ball’s contact point and the ground simply changes sign during the bounce. 


20. Many bounces 

Eq. (7.59) gives the result after one bounce, so the result after two bounces is 


3/7 - 4/7 )u: 


-10/7 -3/7 , 


' 3/7 -4/7 N 

-10/7 -3/7 , 
: 1 OUr/ 
0 1 ) V Ru , 


(7.138) 


The square of the matrix turns out to be the identity. Therefore, after two bounces, 
both v x and u> return to their original values. The ball then repeats the motion of 


10 We have divided out the trivial o/ = uj and v' x = v x solution, which corresponds to slipping 
motion on a frictionless plane. The nontrivial solution we will find shortly is the non-slipping one. 
Basically, to conserve energy, there must be no work done by friction. And since work is force times 
distance, this means that either the plane is frictionless, or that there is no relative motion between 
ball’s contact point and the plane. The latter case is the one we are concerned with here. 
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the previous two bounces (and so on, after every two bounces). The only difference 
between successive pairs of bounces is that the ball may shift horizontally. You are 
strongly encouraged to experimentally verify this interesting periodic behavior. 

21. Rolling over a bump 

We will use the fact that the angular momentum of the ball with respect to the corner 
of the step (call this point P) is unchanged by the collision. This is true because any 
forces exerted at point P provide zero torque around P. 11 This fact will allow us to 
find the energy of the ball right after the collision, which we will then require to be 
greater than mgh. 

Breaking the initial L into the contribution relative to the CM, plus the contribution 
from the ball treated like a point mass located at the CM, we see that the initial 
angular momentum is L = (2/5)mfl 2 wo + mVo(R — h), where uiq is the initial angular 
speed. But the non-slipping condition tells us that wo = Vo/P. Therefore, L may be 
written as 

L = ?mPVb + mV 0 (R - h) = mV 0 - hj . (7.139) 


Let a/ be the angular speed of the ball around point P immediately after the colli¬ 
sion. The parallel-axis theorem says that the moment of inertia around P is equal 
to (2/5)mP 2 + toP 2 = (7/5)mP 2 . Conservation of L (around point P) during the 
collision then gives 


(7 R 


(7.140) 


The energy of the ball right after the collision is therefore 

E=\( lmR i 


2 \ 5 




The ball will climb up over the step if E > mgh, which gives 

/lOflfe / 


. (7.141) 


(7.142) 


Remarks: Note that is possible for the ball to rise up over the step even if h > R, provided 
that the ball sticks to the corner, without slipping. (If h > R, the step would have to be 
“hollowed out” so that the ball doesn’t collide with the side of the step.) But note that 
Vo —> oo as h —> 7P/5. For h > 7P/5, it is impossible for the ball to make it up over the 
step, no matter how large Vo is. The ball will get pushed down into the ground, instead of 
rising up, if h > 7R/5. 

For an object with a general moment of inertia I = gmR 2 (so g = 2/5 in our problem), you 
can show that the minimum initial speed is 


Vo > 


/ 2 gh ( h V 

V 1 + V (! + rf)R) 


(7.143) 


This decreases as g increases. It is smallest when the “ball” is a wheel with all the mass on 
its rim (so that g = 1), in which case it is possible for the wheel to climb up over the step 
even if h is close to 2R. Jit 


11 The torque from gravity will be relevant once the ball rises up off the ground. But during the 
(instantaneous) collision, L will not change. 
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22. Lots of sticks 

Consider the collision between two sticks. Let V be the speed of the contact point on 
the heavy one. Since this stick is essentially infinitely heavy, we may consider it to 
be an infinitely heavy ball, moving at speed V. The rotational degree of freedom of 
the heavy stick is irrelevant, as far as the light stick is concerned. 

We may therefore invoke the result of Problem 17 to say that the relative speed of 
the contact points is the same before and after the collision. This implies that the 
contact point on the light stick picks up a speed of 2V, because the heavy stick is 
essentially unaffected by the collision and keeps moving at speed V. 

Let us now find the speed of the other end of the light stick. This stick receives an 
impulse from the heavy stick, so we can apply eq. (7.58) to the light stick to obtain 

rjmr 2 u> = r(mv CM ) => ru = . (7.144) 

The speed of the struck end is n str = ru + i> CM , because the rotational speed adds to 
the CM motion. The speed of the other end is t’ 0 th = ruj — v CM , because the rotational 
speed subtracts from the CM motion. 12 The ratio of these speeds is 


In the problem at hand, 


wth _ ^ir Vcm _ 1 - v 

Vstr ~ ^ + Vcm “ 1 + V ' 

have t’str = 2V. Therefore, 
V ('Slrifif 


(7.145) 


(7.146) 


The same analysis holds for all the other collisions. Therefore, the bottom ends of the 
sticks move with speeds that form a geometric progression with ratio 2(1 — r/)/(l + rf). 
If this ratio is less than 1 (that is, if r) > 1/3), then the speeds go to zero as n —* oo. 
If it is greater than 1 (that is, if 77 < 1/3), then the speeds go to infinity as n —> 00. 
If it equals 1 (that is, if rj = 1/3), then the speeds remain equal to V and are thus 
independent of n, as we wanted to show. A uniform stick has 77 = 1/3 relative to its 
center (which is usually written in the form I = mt 2 j 12, where £ = 2r). 


12 Since 77 < 1 for any real stick, we have ruj = Vcm/v > v C m ■ Therefore, rw — v C m 
or equal to zero. 


is greater than 
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Chapter 8 

Angular Momentum, Part II 
(General L) 

Copyright 2004 by David Morin, morin@physics.harvard.edu 


In the Chapter 7, we discussed situations where the direction of the vector L remains 
constant, and only its magnitude changes. In this chapter, we will look at the more 
complicated situations where the direction of L is allowed to change. The vector 
nature of L will prove to be vital, and we will arrive at all sorts of strange results 
for spinning tops and such things. 

This chapter is rather long, alas. The first three sections consist of general 
theory, and then in Section 8.4 we start solving some actual problems. 

8.1 Preliminaries concerning rotations 

8.1.1 The form of general motion 

Before getting started, we should make sure we’re all on the same page concerning 
a few important things about rotations. Because rotations generally involve three 
dimensions, they can often be hard to visualize. A rough drawing on a piece of 
paper might not do the trick. For this reason, this topic is one of the more difficult 
ones in this book. 

The next few pages consist of some definitions and helpful theorems. This first 
theorem describes the form of general motion. You might consider it obvious, but 
let’s prove it anyway. 

Theorem 8.1 Consider a rigid body undergoing arbitrary motion. Pick any point 
P in the body. Then at any instant (see Fig. 8.1), the motion of the body may be 
written as the sum of the translational motion of P, plus a rotation around some 
axis, uj, through P (the axis oj may change with time). 1 

Proof: The motion of the body may be written as the sum of the translational 
motion of P, plus some other motion relative to P (this is true because relative 

1 In other words, what we mean here is that a person at rest with respect to a frame whose origin 
is P, and whose axes are parallel to the fixed-frame axes, will see the body undergoing a rotation 
around some axis through P. 
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coordinates are additive quantities). We must show that this latter motion is simply 
a rotation. This seems quite plausible, and it holds because the body is rigid; that 
is, all points keep the same relative distances. (If the body weren’t rigid, then this 
theorem wouldn’t be true.) 

To be rigorous, consider a sphere fixed in the body, centered at P. The motion 
of the body is completely determined by the motion of the points on this sphere, so 
we need only examine what happens to the sphere. And because we are looking at 
motion relative to P, we have reduced the problem to the following: In what manner 
can a rigid sphere transform into itself? We claim that any such transformation 
requires that two points end up where they started. 2 

If this claim is true, then we are done, because for an infinitesimal transforma¬ 
tion, a given point moves in only one direction (since there is no time to do any 
bending). So a point that ends up where it started must have always been fixed. 
Therefore, the diameter joining the two fixed points remains stationary (because 
distances are preserved), and we are left with a rotation around this axis. 

This claim is quite believable, but nevertheless tricky to prove. I can’t resist 
making you think about it, so I’ve left it as a problem (Problem 1). Try to solve it 
on your own. ■ 

We will invoke this theorem repeatedly in this chapter (often without bothering 
to say so). Note that it is required that P be a point in the body, since we used the 
fact that P keeps the same distances from other points in the body. 

Remark: A situation where our theorem is not so obvious is the following. Consider 
an object rotating around a fixed axis, the stick shown in Fig. 8.2. In this case, u> simply 
points along the stick. But now imagine grabbing the stick and rotating it around some 
other axis (the dotted line shown). It is not immediately obvious that the resulting motion 
is (instantaneously) a rotation around some new axis through A. But indeed it is. (We’ll 
be quantitative about this in the “Rotating Sphere” example near the end of this section.) 
* 

8.1.2 The angular velocity vector 

It is extremely useful to introduce the angular velocity vector, u>, which is defined 
to point along the axis of rotation, with a magnitude equal to the angular speed. 
The choice of the two possible directions is given by the right-hand rule. (Curl your 
right-hand fingers in the direction of the spin, and your thumb will point in the 
direction of u>.) For example, a spinning record has u> perpendicular to the record, 
through its center (as shown in Fig. 8.3), with magnitude equal to the angular 
speed, u. 

Remark: You could, of course, break the mold and use the left-hand rule, as long 
as you use it consistently. The direction of cu would be opposite, but that doesn’t matter, 
because cJ isn’t really physical. Any physical result (for example, the velocity of a particle, 

2 This claim is actually true for any transformation of a rigid sphere into itself, but for the present 
purposes we are concerned only with infinitesimal transformations (because we are only looking at 
what happens at a given instant in time). 
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or the force on it) will come out the same, independent of which hand you (consistently) 
use. 


When studying vectors in school, 

You’ll use your right hand as a tool. 

But look in a mirror, 

And then you’ll see clearer, 

You can just use the left-handed rule. 4k 

The points on the axis of rotation are the ones that (instantaneously) do not 
move. Of, course, the direction of u> may change over time, so the points that were 
formerly on u> may now be moving. 

Remark: The fact that we can specify a rotation by specifying a vector w is a peculiar¬ 
ity to three dimensions. If we lived in one dimension, then there would be no such thing as 
a rotation. If we lived in two dimensions, then all rotations would take place in that plane, 
so we could label a rotation by simply giving its speed, oj. In three dimensions, rotations 
take place in ('^) = 3 independent planes. And we choose to label these, for convenience, 
by the directions orthogonal to these planes, and by the angular speed in each plane. If we 
lived in four dimensions, then rotations could take place in ()j) = 6 planes, so we would have 
to label a rotation by giving 6 planes and 6 angular speeds. Note that a vector (which has 
four components in four dimensions) would not do the trick here. 4k 

In addition to specifying the points that are instantaneously motionless, u> also 
easily produces the velocity of any point in the rotating object. Consider the case 
where the axis of rotation passes through the origin (which we will generally assume 
to be the case in this chapter, unless otherwise stated). Then we have the following 
theorem. 

Theorem 8.2 Given an object rotating with angular velocity u), the velocity of any 
point in the object is given by (with r being the position of the point) 

v = u> x~r~| . (8.1) 

Proof: Drop a perpendicular from the point in question (call it P) to the axis u> 
(call the point there Q ). Let r' be the vector from Q to P (see Fig. 8.4). From the 
properties of the cross product, v = w x r is orthogonal to u>. r, and also r' (since 
r' is a linear combination of u and r). Therefore, the direction of v is correct (it 
lies in a plane perpendicular to u>, and is also perpendicular to r', so it describes 
circular motion around the axis u>; also, by the right-hand rule, it points in the 
proper orientation around w). And since 

|v| — M|r| sin 0 — ur 1 . (8.2) 

which is the speed of the circular motion around u>, we see that v has the correct 
magnitude. So v is indeed the correct velocity vector. ■ 

Note that if we have the special case where P lies along u>, then r is parallel to 
u), and so the cross product gives a zero result for v, as it should. 


(0 



Figure 8.4 
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Eq. (8.1) is extremely useful and will be applied repeatedly in this chapter. Even 
if it’s hard to visualize what’s going on with a given rotation, all you have to do to 
find the speed of any given point is calculate the cross product m x r. 

Conversely, if the speed of every point in a moving body is given by v = m x r, 
then the body is undergoing a rotation with angular velocity m (because all points 
on the axis m are motionless, and all other points move with the proper speed for 
this rotation). 

A very nice thing about angular velocities is that they simply add. Stated more 
precisely, we have the following theorem. 

Theorem 8.3 Let coordinate systems Si, S2, and S3 have the same origin. Let 
Si rotate with angular velocity with respect to SA Let S2 rotate with angular 
velocity m2..3 with respect to S3. Then Si rotates (instantaneously) with angular 
velocity 

Wl,3 = ^1,2 + <*>2,3 (8.3) 

with respect to S3. 

Proof: If mi,2 and m2.3 point in the same direction, then the theorem is clear; 
the angular speeds just add. If, however, they don’t point in the same direction, 
then things are a little harder to visualize. But we can prove the theorem by simply 
making abundant use of the definition of m. 

Pick a point Pi at rest in Si. Let r be the vector from the origin to Pi. The 
velocity of Pi (relative to a very close point P2 at rest in S2) due to the rotation 
about m^2 is Vp, p 2 = mi, 2 x r. The velocity of P2 (relative to a very close point 
P3 at rest in S3) due to the rotation about m2,3 is V p 2 p 3 = m2,3 x r (because P2 is 
also located essentially at position r). Therefore, the velocity of Pi (relative to P3) 
is Vpjp 2 + V p 2 p 3 = (mi,2 + m2,3) x r. This holds for any point Pi at rest in Sj. So 
the frame Si rotates with angular velocity (mi,2 + m2,3) with respect to S3. ■ 

Note that if mi,2 is constant in S2, then the vector mi,3 = mi,2 + m2,3 will change 
with respect to S3 as time goes by (because mi,2, which is fixed in S2, is changing 
with respect to S3). But at any instant, mi, 3 may be obtained by simply adding the 
present values of mi,2 and m2,3. Consider the following example. 



Example (Rotating sphere): A sphere rotates with angular speed m3 around a 
stick that initially points in the z direction. You grab the stick and rotate it around 
the y-axis with angular speed u> 2 - What is the angular velocity of the sphere, with 
respect to the lab frame, as time goes by? 

Solution: In the language of Theorem 8.3, the sphere defines the Si frame; the stick 
and the y-axis define the S2 frame; and the lab frame is the S3 frame. The instant 
after you grab the stick, we are given that mi,2 = mgz, and m2,3 = m-iy. Therefore, 
the angular velocity of the sphere with respect to the lab frame is mi,3 = mi,2+m 2 ,3 = 
UJ 3 Z + u> 2 y. This is shown is Fig. 8.5. As time goes by, the stick (and hence mi,2) 
rotates around the y axis, so mi,3 = mi,2 + m2,3 traces out a cone around the y axis, 
as shown. 


Figure 8.5 
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Remark: Note the different behavior of 0)1,3 for a slightly different statement of the problem: 
Let the sphere initially rotate with angular velocity w 2 y. Grab the axis (which points in 
the y direction) and rotate it with angular velocity W3Z. For this situation, 0)1,3 initially 
points in the same direction as in the above statement of the problem (it is initially equal to 
o>3Z + w 2 y), but as time goes by, it is the o> 2 y vector that will change, so 0)1,3 = 0)1,2 + 0)2,3 
traces out a cone around the z axis, as shown in Fig. 8.6. ft 


An important point concerning rotations in that they are defined with respect 
to a coordinate system. It makes no sense to ask how fast an object is rotating 
with respect to a certain point, or even a certain axis. Consider, for example, an 
object rotating with angular velocity u> = W 3 Z, with respect to the lab frame. Saying 
only, “The object has angular velocity u> = u>->z,” is not sufficient, because someone 
standing in the frame of the object would measure u) = 0 , and would therefore be 
very confused by your statement. 

Throughout this chapter, we’ll try to remember to state the coordinate system 
with respect to which u) is measured. But if we forget, the default frame is the lab 
frame. 

If you want to strain some brain cells thinking about a; vectors, you are encour¬ 
aged to solve Problem 3, and then also to look at the three given solutions. 

This section was a bit abstract, so don’t worry too much about it at the moment. 
The best strategy is probably to read on, and then come back for a second pass after 
digesting a few more sections. At any rate, we’ll be discussing many other aspects 
of u> in Section 8.7.2. 



Figure 8.6 


8.2 The inertia tensor 


Given an object undergoing general motion, the inertia tensor is what relates the 
angular momentum, L, to the angular velocity, u>. This tensor 3 depends on the 
geometry of the object, as we will see. In finding the L due to general motion, we 
will (in the same spirit as in Section 7.1) first look at the special case of rotation 
around an axis through the origin. Then we will look at the most general possible 
motion. 

8.2.1 Rotation about an axis through the origin 

The three-dimensional object in Fig. 8.7 rotates with angular velocity u>. Consider 
a little piece of the body, with mass dm and position r. The velocity of this piece is 
v = oj x r. So the angular momentum (relative to the origin) of this piece is equal 
to r x p = (dm) r x v = (dm) r x (u> x r). The angular momentum of the entire 
body is therefore 

L = J r x (u> x r) dm, (8.4) 

where the integration runs over the volume of the body. 



1 “Tensor” is just a fancy name for “matrix” here. 
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In the case where the rigid body is made up of a collection of point masses, rrii , 
the angular momentum is simply 

L = x (oj x rj). (8.5) 

This double cross-product looks a bit intimidating, but it’s actually not so bad. 
First, we have 


x y z 



= (iv 2 z - uj 3 y)x.+ (ojzx - w\z)y + (u\y - uj 2 x)z,. (8.6) 

Therefore, 

x y z 

rx(wxr)= x y z 

(u 2 z - u 3 y) ( uj 3 x - mi z) ( uiiy - u 2 x) 

= (u^y 2 + z 2 ) - u 2 xy - u 3 zx^-k 


+ (oj 2 (z 2 + x 2 ) - U) 3 yz - Uixy^jy 

+ (uj 3 (x 2 + y 2 )-uizx-u 2 yz^z. (8.7) 

The angular momentum in eq. (8.4) may therefore be written in the nice, concise, 
matrix form, 



For sake of clarity, we have not bothered to write the dm part of each integral. The 
matrix I is called the inertia tensor. If the word “tensor” scares you, just ignore it. 
I is simply a matrix. It acts on one vector (the angular velocity) to yield another 
vector (the angular momentum). 

Remarks: 

1. I is a rather formidable-looking object. Therefore, you will undoubtedly be very 
pleased to hear that you will rarely have to use it. It’s nice to know that it’s there 
if you do need it, but the concept of principal axes in Section 8.3 provides a much 
better way of solving problems, which avoids the use of the inertia tensor. 

2. I is a symmetric matrix. (This fact will be important in Section 8.3.) There are 
therefore only six independent entries, instead of nine. 
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3. In the case where the rigid body is made up of a collection of point masses, m*, the 

entries in the matrix are just sums. For example, the upper left entry is m i (vf Azf). 

4. I depends only on the geometry of the object, and not on uj. 

5. To construct an I, you not only need to specify the origin, you also need to specify 
the x,y,z axes of your coordinate system. (These basis vectors must be orthogonal, 
because the cross-product calculation above is valid only for an orthonormal basis.) 
If someone else comes along and chooses a different orthonormal basis (but the same 
origin), then her I will have different entries , as will her uj, as will her L. But her 
uj and L will be exactly the same vectors as your uj and L. They will only appear 
different because they are written in a different coordinate system. (A vector is what 
it is, independent of how you choose to look at it. If you each point your arm in the 
direction of what you calculate L to be, then you will both be pointing in the same 
direction.) £ 

All this is fine and dandy. Given any rigid body, we can calculate I (relative to 
a given origin, using a given set of axes). And given u>, we can then apply I to it 
to find L (relative to the origin). But what do these entries in I really mean? How 
do we interpret them? Note, for example, that the L3 in eq. (8.8) contains terms 
involving uji and u 2. But uj\ and u 2 have to do with rotations around the x and y 
axes, so what in the world are they doing in L3? Consider the following examples. 


Example 1 (Point-mass in x-y plane): Consider a point-mass m traveling in a 
circle (centered at the origin) in the x-y plane, with frequency w 3 . Let the radius of 
the circle be r (see Fig. 8.8). 

Using uj = (0,0,W3), x 2 + y 2 = r 2 , and z = 0 in eq. (8.8) (with a discrete sum of 
only one object, instead of the integrals), the angular momentum with respect to the 
origin is 

L = (0,0,mr 2 w 3 ). (8.9) 

The 2-component is rrirv. as it should be. And the x- and y-components are 0, as 
they should be. This case where uq = 002 = 0 and z = 0 is simply the case we studied 
in the Chapter 7. 


Example 2 (Point-mass in space): Consider a point-mass m traveling in a circle 
of radius r, with frequency u> 3. But now let the circle be centered at the point (0,0, zq), 
with the plane of the circle parallel to the x-y plane (see Fig. 8.9). 

Using uj = (0,0, w 3 ), x 2 + y 2 = r 2 , and 2 = zq in eq. (8.8), the angular momentum 
with respect to the origin is 


L = mw 3 (-xzo,-yzo,r 2 ). (8.10) 

The ^-component is mrv, as it should be. But, surprisingly, we have nonzero L\ and 
1/2, even though our mass is simply rotating around the 2-axis. What’s going on? 
Consider the instant when the mass is in the x-z plane. The velocity of the mass is 
then in the y direction. Therefore, the particle most certainly has angular momentum 
around the rr-axis, as well as the 2-axis. (Someone looking at a split-second movie 
of the particle at this point could not tell whether the mass was rotating around the 



Figure 8.8 
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Figure 8.9 
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Figure 8.10 


x-axis, the 2-axis, or undergoing some complicated motion. But the past and future 
motion is irrelevant; at any instant in time, as far as the angular momentum goes, we 
are concerned only with what is happening at that instant.) 

At this instant, the angular momentum around the x-axis is —rnz^v (since Zq is the 
distance from the x-axis; and the minus sign comes from the right-hand rule). Using 
v = 0J3X, we have L\ = —mxzoojs, in agreement with eq. (8.10). 

At this instant, L 2 is zero, since the velocity is parallel to the y-axis. This agrees with 
eq. (8.10), since y = 0. And you can check that eq. (8.10) is indeed correct when the 
mass is at a general point (x,y,zo). 

For a point mass, L is much more easily obtained by simply calculating L = rxp 
(you should use this to check the results of this example). But for more complicated 
objects, the tensor I must be used. 


Example 3 (Two point-masses): Add another point-mass m to the previous 
example. Let it travel in the same circle, at the diametrically opposite point (see 
Fig. 8.10). 

Using u = (0,0, w 3 ), x 2 + y 2 = r 2 , and 2 = 20 in eq. (8.8), you can show that the 
angular momentum with respect to the origin is 

L = 2mw 3 (0,0,r 2 ). (8.11) 

The 2-component is 2 mrv, as it should be. And L\ and L 2 are zero, unlike in the 
previous example, because these components of the L’s of the two particles cancel. 
This occurs because of the symmetry of the masses around the 2-axis, which causes 
the I zx and I zy entries in the inertia tensor to vanish (because they are each the sum 
of two terms, with opposite x values, or opposite y values). 


Let’s now look at the kinetic energy of our object (which is rotating about an 
axis passing through the origin). To find this, we need to add up the kinetic energies 
of all the little pieces. A little piece has energy (dm) v 2 /2 = dm \u> x r| 2 /2. So, using 
eq. (8.6), the total kinetic energy is 


T = \ J - w 3 y) 2 + (^x - W12) 2 + (ui y - u 2 x ) 2 


( 8 . 12 ) 


Multiplying this out, we see (after a little work) that we may write T as 


( Kv 2 + z 2 ) 

-fxy 

-fzx \ 

/ut\ 

-fxy 

f(z 2 + x 2 ) 

-fyz 

uj 2 

V -1 zx 

-fyz 

f(x 2 + y 2 ) y 

\ W 3/ 


(8.13) 


If u = cu 3 z, then this reduces to the T = /33CU3 /2 result in eq. (7.8) in Chapter 7 
(with a slight change in notation). 
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8.2.2 General motion 

How do we deal with general motion in space? For the motion in Fig. 8.11, the 
various pieces of mass are not traveling in circles about the origin, so we cannot 
write v = m x r, as we did prior to eq. (8.4). 

To determine L (relative to the origin), and also the kinetic energy T, we will 
invoke Theorem 8.1. In applying this theorem, we may choose any point in the body 
to be the point P in the theorem. However, only in the case that P is the object’s 
CM can we extract anything useful. The theorem then says that the motion of the 
body is the sum of the motion of the CM plus a rotation about the CM. So, let the 
CM move with velocity V, and let the body instantaneously rotate with angular 
velocity u/ around the CM. (That is, with respect to the frame whose origin is the 
CM, and whose axes are parallel to the fixed-frame axes.) 

Let the CM coordinates be R = (X,Y,Z), and let the coordinates relative to 
the CM be r' = ( x ', y', z'). Then r = R+r' (see Fig. 8.12). Let the velocity relative 
to the CM be v' (so v' = u/ x r'). Then v = V + v'. 

Let’s look at L first. The angular momentum is 

L = J r x v dm 

= f (R + r') x (V + (u/ x r')) dm 

= J (R x V) dm + J r' x (u/ x r') dm 
= M(RxV) + L cm . (8.14) 

The cross terms vanish because the integrands are linear in r 1 (and so the inte¬ 
grals, which involve /r 'dm, are zero by definition of the CM). Lcm is the angular 
momentum relative to the CM. 4 

As in the pancake case Section 7.1.2, we see that the angular momentum (relative 
to the origin) of a body can be found by treating the body as a point mass located 
at the CM and finding the angular momentum of this point mass (relative to the 
origin), and by then adding on the angular momentum of the body, relative to the 
CM. Note that these two parts of the angular momentum need not point in the 
same direction (as they did in the pancake case). 

Now let’s look at T. The kinetic energy is 

T = 

= /I|V + v'| 2 dm 

= J^V 2 dm + J^v' 2 dm 

= ^MV 2 + J ^\u>' x r'\ 2 dm 

4 By this, we mean the angular momentum as measured in the coordinate system whose origin 
is the CM, and whose axes are parallel to the fixed-frame axes. 



Figure 8.11 
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= l -MV 2 + l -J • L C m, (8.15) 

where the last line follows from the steps leading to eq. (8.13). The cross term 
fV-v' dm = f V • (a / x r') dm vanishes because the integrand is linear in r' (and 
thus yields a zero integral, by definition of the CM). 

As in the pancake case Section 7.1.2, we see that the kinetic energy of a body 
can be found by treating the body as a point mass located at the CM, and by then 
adding on the kinetic energy of the body due to motion relative to the CM. 


8.2.3 The parallel-axis theorem 

Consider the special case where the CM rotates around the origin with the same 
angular velocity at which the body rotates around the CM (see Fig. 8.13). That is, 
V = u)' x R, (This may be achieved, for example, by having a rod stick out of the 
body and pivoting one end of the rod at the origin.) This means that we have the 
nice situation where all points in the body travel in fixed circles around the axis of 
rotation (because v = V + v' = u/ x R + u/ x r' = u/ x r). Dropping the prime on 
to , eq. (8.14) becomes 

L = MR x (w x R) + j r' x (u x r') dm 

Expanding the double cross-products as in the steps leading 
may write this as 

/ Li \ / Y 2 + Z 2 —XY -ZX \ /wi 

L 2 = M -XY Z 2 + X 2 -YZ u 2 

\L 3 J \ -ZX -YZ X 2 + Y 2 ) \ w 3 

(Ry' 2 + z' 2 ) -fx'y' -fz'x' ' 

+ -fx'y' f(z’ 2 + x' 2 ) — f y'z' 

V -Jz'x' - f y'z' J(x ,2 + y' 2 ) j 

= (Ir + Icm)w. 

This is the generalized parallel-axis theorem. It says that once you’ve calculated 
Icm for an axis through the CM, then if you want to calculate I around any parallel 
axis, you simply have to add on the Ir matrix (obtained by treating the object like 
a point-mass at the CM). So you have to compute six numbers (there are only six, 
instead of nine, because the matrix is symmetric) instead of just the one MR 2 in 
the parallel-axis theorem in Chapter 7, given in eq. (7.12). 

Likewise, if V = u/ x R, then eq. (8.15) gives (dropping the prime on u>) a 
kinetic energy of 

T = -u ■ (Ir + Icm)w = -u) ■ L. 


(8.16) 
to eq. (8.8), we 



(8.17) 


( 8 . 18 ) 
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8.3 Principal axes 

The cumbersome expressions in the previous section may seem a bit unsettling, but 
it turns out that you will rarely have to invoke them. The strategy for avoiding all 
the previous mess is to use the principal axes of a body, which we will define below. 

In general, the inertia tensor I in eq. ( 8 . 8 ) has nine nonzero entries (six inde¬ 
pendent ones). In addition to depending on the origin chosen, this inertia tensor 
depends on the set of orthonormal basis vectors chosen for the coordinate system. 
(The x,y,z variables in the integrals in I depend on the coordinate system with 
respect to which they are measured, of course.) 

Given a blob of material, and given an arbitrary origin , 5 any orthonormal set of 
basis vectors is usable, but there is one special set that makes all our calculations 
very nice. These special basis vectors are called the principal axes. They can be 
defined in various equivalent ways. 

• The principal axes are the orthonormal basis vectors for which I is diagonal, 
that is, for which 6 

( h 0 0 \ 

I = 0 h o . (8.19) 

V 0 0 h ) 

T\ . I 2 , and Is are called the principal moments. 

For many objects, it is quite obvious what the principal axes are. For example, 
consider a uniform rectangle in the x-y plane, and let the CM be the origin (and 
let the sides be parallel to the coordinate axes). Then the principal axes are 
clearly the x, y, and z axes, because all the off-diagonal elements of the inertia 
tensor in eq. ( 8 . 8 ) vanish, by symmetry. For example I xy = — f xydm equals 
zero, because for every point (x, y) in the rectangle, there is a corresponding 
point (—x,y). So the contributions to / xydm cancel in pairs. Also, the 
integrals involving z are identically zero, because z = 0 . 

• The principal axes are the orthonormal set wj, u> 2 , CO 3 with the property that 

Io>i = Iiwi, Iu>2 = -I2W2, = /3CJ3. (8.20) 

(That is, they are the uj’s for which L points in the same direction as u>.) 
These three statements are equivalent to eq. (8.19), because the vectors u>i, 
a> 2 , and u >3 are simply ( 1 , 0 , 0 ), ( 0 , 1 , 0 ), and ( 0 , 0 , 1 ) in the frame in which 
they are the basis vectors. 

• The principal axes are the axes around which the object can rotate with con¬ 
stant speed, without the need for any torque. (So in some sense, the object is 

the CM is often chosen to be the origin, but it need not be. There are principal axes associated 
with any origin. 

technically, we should be writing In instead of 7i, etc., in this matrix, because we’re talking 
about elements of a matrix. (The one-index object h looks like a component of a vector.) But the 
two-index notation gets cumbersome, so we’ll be sloppy and just use h, etc. 
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“happy” to spin around a principal axis.) This is equivalent to the previous 
definition for the following reason. Assume the object rotates around an axis 
d>i, for which L = Iu>\ = I\U\, as in eq. (8.20). Then, since u >i is assumed to 
be fixed, we see that L is also fixed. Therefore, r = dL/dt = 0. 

The lack of need for any torque, for rotation around a principal axis to, means 
that if the object is pivoted at the origin, and if the origin is the only place where 
any force is applied, then the object can undergo rotation with constant angular 
velocity u. If you try to set up this scenario with a non-principal axis, it won’t 
work. 


Example (Square with origin at corner): Consider the uniform square in 

Fig. 8.14. In Appendix G, we show that the principal axes are the dotted lines 
shown (and also the 2 -axis perpendicular to the page). But there is no need to use 
the techniques of the appendix to see this, because in this basis it is clear that the 
integral f X\X^ is zero, by symmetry. (And £3 = 2 is identically zero, which makes 
the other off-diagonal terms in I also equal to zero.) 

Furthermore, it is intuitively clear that the square will be happy to rotate around 
any one of these axes indefinitely. During such a rotation, the pivot will certainly be 
supplying a force (if the axis is uq or z), to provide the centripetal acceleration for 
the circular motion of the CM. But it will not be applying a torque relative to the 
origin (because the r in r x F is 0). This is good, because for a rotation around one 
of these principal axes, dL/dt = 0, and there is no need for any torque. 

It is fairly clear that it is impossible to make the square rotate around, say, the x-axis, 
assuming that its only contact with the world is through a free pivot at the origin. 
The square simply doesn’t want to remain in that circular motion. There are various 
ways to demonstrate this rigorously. One is to show that L (relative to the origin) 
will not point along the x-axis, so it will therefore precess around the x-axis along 
with the square, tracing out the surface of a cone. This means that L is changing. 
But there is no torque available (relative to the origin) to provide for this change in 
L. Hence, such a rotation cannot exist. 

Note also that the integral f xy is not equal to zero (every point gives a positive 
contribution). So the inertia tensor is not diagonal in the x-y basis, which means that 
x and y are not principal axes. 


At the moment, it is not at all obvious that an orthonormal set of principal 
axes exists for an arbitrary object. This is the task of Theorem 8.4 below. But 
assuming that principal axes do exist, the L and T in eqs. ( 8 . 8 ) and (8.13) take on 
the particularly nice forms, 

L = (J 1 W 1 , J 2 W2,/3W3), 

T = -(iicui + h^2 + hv\)- (8.21) 

in the basis of the principal axes. (The numbers uq, u> 2 , and uq are the components 
of a general vector u> written in the principal-axis basis; that is, u = oquq + CO 2&2 + 
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This is a vast simplification over the general formulas in eqs. (8.8) and 
(8.13). We will therefore invariably work with principal axes in the remainder of 
this chapter. 

Remark: Note that the directions of the principal axes (relative to the body) depend 
only on the geometry of the body. They may therefore be considered to be painted onto the 
object. Hence, they will generally move around in space as the body rotates. (For example, 
in the special case where the object is rotating happily around a principal axis, then that 
axis will stay fixed, and the other two principal axes will rotate around it in space.) In 
equations like u) = (uq , u> 2 , cog) and L = (7iuq , 1 2 ^ 2 ,I 3 W 3 ), the components u>i and are 
measured along the instantaneous principal axes uq. Since these axes change with time, the 
components u>i and Ru>i will generally change with time (except in the case where we have 
a nice rotation around a principal axis). & 

Let us now prove that a set of principal axes does indeed exist, for any object, 
and any origin. Actually, we’ll just state the theorem here. The proof involves a 
rather slick and useful technique, but it’s slightly off the main line of thought, so 
we’ll relegate it to Appendix F. Take a look at the proof if you wish, but if you want 
to simply accept the fact that the principal axes exist, that’s fine. 

Theorem 8.4 Given a real symmetric 3x3 matrix, I, there exist three orthonormal 
real vectors, u>k, and three real numbers, Ik, with the property that 


Lh fc = Ifcchfc. 


( 8 . 22 ) 


Proof: See Appendix F. ■ 

Since the inertia tensor in eq. (8.8) is indeed symmetric, for any body and any 
origin, this theorem says that we can always find three orthogonal basis-vectors for 
which I is a diagonal matrix. That is, principal axes always exist. Invariably, it 
is best to work in a coordinate system that has this basis. (As mentioned above, 
the CM is generally chosen to be the origin, but this is not necessary. There are 
principal axes associated with any origin.) 

Problem 5 gives another way to show the existence of principal axes in the special 
case of a pancake object. 

For an object with a fair amount of symmetry, the principal axes are usually the 
obvious choices and can be written down by simply looking at the object (examples 
are given below). If, however, you are given an unsymmetrical body, then the only 
way to determine the principal axes is to pick an arbitrary basis, then find I in this 
basis, then go through a diagonalization procedure. This diagonalization procedure 
basically consists of the steps at the beginning of the proof of Theorem 8.4 (given in 
Appendix F), with the addition of one more step to get the actual vectors, so we’ll 
relegate it to Appendix G. You need not worry much about this method. Virtually 
every problem we encounter will involve an object with sufficient symmetry to enable 
you to simply write down the principal axes. 

Let’s now prove two very useful (and very similar) theorems, and then we’ll give 
some examples. 
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Theorem 8.5 If two principal moments are equal (I\ = 1-2 = 1), then any axis 
(through the chosen origin) in the plane of the corresponding principal axes is a 
principal axis (and its moment is also I). 

Similarly, if all three principal moments are equal (Ji = L 2 = I 3 = I), then any 
axis (through the chosen origin) in space is a principal axis (and its moment is also 

I)- 

Proof: This first part was already proved at the end of the proof in Appendix F, 
but we’ll do it again here. Let I\ = I 2 = I■ Then Iui = Jui, and IU2 = /u 2 . Hence, 
I(aui + 6112) = /(aui + 6112). Therefore, any linear combination of Ui and 112 is a 
solution to Iu = Ju and is thus a principal axis, by definition. 

Similarly, let I\ = h = I3 = I • Then Iui = Jui, I112 = I112, and IU3 = /113. 
Hence, I(aui + &U2 + CU3) = /(aui + bu 2 + CU3). Therefore, any linear combination 
of ui, U2, and 113 (that is, any vector in space) is a solution to Iu = Iu and is thus 
a principal axis, by definition. 

Basically, if I\ = I 2 = I , then I is (up to a multiple) the identity matrix in the 
space spanned by U\ and u 2 . And if Ii = I 2 = I3 = I, then I is (up to a multiple) 
the identity matrix in the entire space. ■ 

If two or three moments are equal, so that there is freedom in choosing the 
principal axes, then it is possible to pick a non-orthogonal group of them. We will, 
however, always choose ones that are orthogonal. So when we say “a set of principal 
axes”, we mean an orthonormal set. 

Theorem 8.6 If a pancake object is symmetric under a rotation through an angle 
6 180° in the x-y plane (for example, a hexagon), then every axis in the x-y plane 

(with the origin chosen to be the center of the symmetry rotation) is a principal axis. 

Proof: Let u>o be a principal axis in the plane, and let ug be the axis obtained 

by rotating u 0 through the angle 9 . Then ug is also a principal axis with the same 
principal moment (due to the symmetry of the object). Therefore, Iuq = Iu 0, and 

lu e = I&e- 

Now, any vector u in the x-y plane can be written as a linear combination of 
a>o and < 1 )q, provided that 9 7^ 180° (this is where we use that assumption). That 
is, u>o and ug span the x-y plane. Therefore, any vector ui may be written as 
u = au>o + bug, and so 

I u = I(aw 0 + bug) = aluo + blug = Iu. (8.23) 

Hence, u is also a principal axis. (Problem 6 gives another proof of this theorem.) 


Let’s now give some examples. We’ll state the principal axes for the following 
objects (relative to the origin). Your exercise is to show that these are correct. 
Usually, a quick symmetry argument shows that 


1 = 


I(y 2 + z 2 ) 
-fxy 
-fzx 


-fxy 


-fyz 


-fzx 
-fyz 
f(x 2 + y 2 ) 


(8.24) 
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is diagonal. In all of these examples (see Fig. 8.15), the origin for the principal axes 
is the origin of the given coordinate system (which is not necessarily the CM). In 
describing the axes, they thus all pass through the origin, in addition to having the 
other properties stated. 


Example 1: Point mass at the origin. 
principal axes: any axes. 

Example 2: Point mass at the point (a;o, yo, zo). 
principal axes: axis through point, any axes perpendicular to this. 
Example 3: Rectangle centered at the origin, as shown. 
principal axes: 2 -axis, axes parallel to sides. 

Example 4: Cylinder with axis as 2 -axis. 
principal axes: 2 -axis, any axes in x-y plane. 

Example 5: Square with one corner at origin, as shown. 
principal axes: z axis, axis through CM, axis perp to this. 



8.4 Two basic types of problems 

The previous three sections introduced many new, and somewhat abstract, concepts. 
We will now (finally) get our hands dirty and solve some actual problems. The 
concept of principal axes, in particular, gives us the ability to solve many kinds of 
problems. Two types, however, come up again and again. There are variations on 
these, of course, but they may be generally stated as follows. 


_L 

Figure 8.15 


• Strike a rigid object with an impulsive (that is, quick) blow. What is the 
motion of the object immediately after the blow? 

• An object rotates around a fixed axis. A given torque is applied. What is the 
frequency of the rotation? (Or conversely, given the frequency, what is the 
required torque?) 


Let’s work through an example for each of these problems. In both cases, the 
solution involves a few standard steps, so we’ll write them out explicitly. 

8.4.1 Motion after an impulsive blow 

Problem: Consider the rigid object in Fig. 8.16. Three masses are connected 
by three massless rods, in the shape of an isosceles right triangle with hypotenuse 
length 4a. The mass at the right angle is 2m, and the other two masses are m. 
Label them A, B, C, as shown. Assume that the object is floating freely in space. 
(Alternatively, let the object hang from a long thread attached to mass C.) 



Figure 8.16 





m 2a 2 a m 

Figure 8.17 
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Mass B is struck with a quick blow, directed into the page. Let the imparted 
impulse have magnitude J F dt = P. (See Section 7.6 for a discussion of impulse 
and angular impulse.) What are the velocities of the three masses immediately after 
the blow? 

Solution: The strategy of the solution will be to find the angular momentum of the 
system (relative to the CM) using the angular impulse, then calculate the principal 
moments and find the angular velocity vector (which will give the velocities relative 
to the CM), and then add on the CM motion. 

The altitude from the right angle to the hypotenuse has length 2a, and the CM 
is easily seen to be located at its midpoint (see Fig. 8.17). Picking the CM as our 
origin, and letting the plane of the paper be the x-y plane, the positions of the three 
masses are = (—2a, — a,0), r# = (2a,—a,0), and = (0,a,0). There are now 
five standard steps that we must perform. 

• Find L: The positive 2 -axis is directed out of the page, so the impulse 

vector is P = f F dt = (0,0,— P). Therefore, the angular momentum of the 
system (relative to the CM) is 

L = J r dt = J (r_s x F) dt = yb X J F dt 

= (2a, -a, 0) X (0,0, - P ) = aP(l, 2,0), (8.25) 

as shown in Fig. 8.17. We have used the fact that yb is essentially constant 
during the blow (because the blow is assumed to happen very quickly) in 
taking yb outside the integral in the above equation. 

• Calculate the principal moments: The principal axes are clearly the x, 
y, and 2 axes. The moments (relative to the CM) are 

I x = ma 2 + ma 2 + ( 2 m)a 2 = Ama 2 , 

I y = m( 2 a ) 2 + m( 2 a ) 2 + ( 2 m) 0 2 = 8 ma 2 , 

I z = I x + I y = 12 ma 2 . (8.26) 

We have used the perpendicular-axis theorem, eq. (7.17), to obtain I z . But 
I z will not be needed to solve the problem. 

• Find u: We now have two expressions for the angular momentum of the 

system. One expression is in terms of the given impulse, eq. (8.25). The other 
is in terms of the moments and the angular velocity components, eq. ( 8 . 21 ). 
Therefore, 


(Ama 2 u> x , 


{I x u x ,I y u y ,I z u> z ) 
8 ma 2 u> y , \2ma 2 u z ) 


as shown in Fig. 8.17. 


aP( 1,2,0) 
oP(l,2,0) 

-h-a.l.O), (8.27) 
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• Calculate speeds relative to CM: Right after the blow, the object rotates 
around the CM with the angular velocity found above. The speeds relative to 
the CM are therefore Uj = u x iq. That is, 

p 

ua = uj x ya = -(1,1,0) x (—2a, — a, 0) = (0,0, P/4m), 

4 ma 
P 

us = uj x yb = -(1,1,0) x (2a, —a,0) = (0, 0, —3P/4m), 

4 ma 

u c = u x r c = — (1,1,0) x (0,a,0) = (0,0, P/4m). (8.28) 

4 ma 

• Add on speed of CM: The impulse (that is, the change in linear momen¬ 
tum) supplied to the whole system is P = (0, 0, —P). The total mass of the 
system is M = 4m. Therefore, the velocity of the CM is 

VcM = Jl = (0, °’ - p / 4m )' ( 8 - 29 ) 

The total velocities of the masses are therefore 

VA = UA + Vcm = (0,0,0), 

v s = Us + Vcm = (0,0, -P/m), 

vc = uc + Vcm = (0,0,0). (8.30) 


Remarks: 

1. We see that masses A and C are instantaneously at rest immediately after the blow, 
and mass B acquires all of the imparted impulse. In retrospect, this is quite clear. 
Basically, it is possible for both A and C to remain at rest while B moves a tiny bit, 
so this is what happens. (If B moves into the page by a small distance e, then A 
and C won’t know that B has moved, since their distances to B will change only by 
a distance of order e 2 .) If we changed the problem and added a mass D at, say, the 
midpoint of the hypotenuse, then this would not be the case; it would not be possible 
for A, C, and D to remain at rest while B moved a tiny bit. So there would be some 
other motion, in addition to B’s. 

2 . As time goes on, the system will undergo a rather complicated motion. What will 
happen is that the CM will move with constant velocity, and the masses will rotate 
around it in a messy (but understandable) manner. Since there are no torques acting 
on the system (after the initial blow), we know that L will forever remain constant. It 
turns out that u> will move around L, and the body will rotate around this changing 
u>. These matters are the subject of Section 8.6. (Although in that discussion, we 
restrict ourselves to symmetric tops; that is, ones with two equal moments.) But these 
issues aside, it’s good to know that we can, without too much difficulty, determine 
what’s going on immediately after the blow. 

3. The body in the above problem was assumed to be floating freely in space. If we 
instead have an object that is pivoted at a given (fixed) point, then we simply want 
to use the pivot as our origin, and there is no need to perform the last step of adding 
on the velocity of the origin (which was the CM, above), since this velocity is now 
zero. Equivalently, just consider the pivot to be an infinite mass, which is therefore 
the location of the (motionless) CM. 4k 
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Figure 8.18 



Figure 8.19 


8.4.2 Frequency of motion due to a torque 

Problem: Consider a stick of length £, mass m, and uniform mass density. The 
stick is pivoted at its top end and swings around the vertical axis. Assume conditions 
have been set up so that the stick always makes an angle 9 with the vertical, as shown 
in Fig. 8.18. What is the frequency, u, of this motion? 

Solution: The strategy of the solution will be to find the principal moments and 
then the angular momentum of the system (in terms of u), then find the rate of 
change of L, and then calculate the torque and equate it with dL/dt. We will 
choose the pivot to be the origin . 7 Again, there are five standard steps that we 
must perform. 

• Calculate the principal moments: The principal axes are clearly the 

axis along the stick, along with any two orthogonal axes perpendicular to the 
stick. So let the x- and y- axes be as shown in Fig. 8.19, and let the z-axis 
point out of the page. The moments (relative to the pivot) are I x = m£ 2 /3, 
I y = 0, and I z = m£ 2 /3. (I z won’t be needed in this solution.) 

• Find L: The angular velocity vector points vertically , 8 so in the basis of the 
principal axes, the angular velocity vector is u = (u sin 6 , u cos 9A)), where u 
is yet to be determined. The angular momentum of the system (relative to 
the pivot) is thus 

L = {I x u x ,lyu y ,l z u z ) = (m£ 2 uj sin 0/3,0,0). (8.31) 


• Find dL/dt: The vector L therefore points upwards to the right, along the 
x-axis (at the instant shown in Fig. 8.19), with magnitude L = m£ 2 uo sin0/3. 
As the stick rotates around the vertical axis, L traces out the surface of a cone. 
That is, the tip of L traces out a horizontal circle. The radius of this circle is 
the horizontal component of L, which is Lcos9. The speed of the tip (that is, 
the magnitude of dL/dt ) is therefore (L cos 9)u, because L rotates around the 
vertical axis with the same frequency as the stick. So, dL/dt has magnitude 

I ^ I = (L cos 9)u = - m£ 2 u 2 sin 9 cos 9 , (8.32) 

l at l 3 

and it points into the page. 

Remark: In more complicated problems (where I y ^ 0), L will point in some messy 
direction (not along a principal axis), and the length of the horizontal component 
(that is, the radius of the circle L traces out) won’t be immediately obvious. In this 
case, you can either explicitly calculate the horizontal component (see the Gyroscope 
example in Section 8.7.5), or you can simply do things the formal (and easier) way by 

7 This is a better choice than the CM, because this way we won’t have to worry about any messy 
forces acting at the pivot, when computing the torque. 

8 However, see the third Remark, following this solution. 
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finding the rate of change of L via the expression dL/dt = uj x L (which holds for all 
the same reasons that w = dr/dt = oj x r holds). In the present problem, we obtain 

dL/dt = (wsin0,wcos0,O) x (m^w sin 0/3,0,0) = (0,0, — mt 2 oj 2 sin0cos0/3), 

(8.33) 

which agrees with eq. (8.32). And the direction is correct, since the negative z-axis 
points into the page. Note that we calculated this cross-product in the principal-axis 
basis. Although these axes are changing in time, they present a perfectly good set of 
basis vectors at any instant. Jit 

• Calculate the torque: The torque (relative to the pivot) is due to gravity, 
which effectively acts on the CM of the stick. So r = r x F has magnitude 

t = rFsinO = ( £/2)(mg ) sin0, (8.34) 

and it points into the page. 

• Equate r with dL/dt: The vectors dL/dt and r both point into the page 
(they had better point in the same direction). Equating their magnitudes gives 

mt 2 oj 2 sin 6 cos 0 _ mgl sin 0 

3 ““ 2 

=* “ - (8 ' 35) 

Remarks: 

1. This frequency is slightly larger than the frequency obtained if we instead have a mass 
at the end of a massless stick of length l. From Problem 12, the frequency in that 
case is \fgjlcos0. So, in some sense, a uniform stick of length t behaves like a mass 
at the end of a massless stick of length 2£/3, as far as these rotations are concerned. 

2. As 0 —> 7 t/2 , the frequency oj goes to oo, which makes sense. And as 0 —> 0, oj 
approaches y/3g/2£, which isn’t so obvious. 

3. As explained in Problem 2, the instantaneous oj is not uniquely defined in some 
situations. At the instant shown in Fig. 8.18, the stick is moving directly into the 
page. So let’s say someone else wants to think of the stick as (instantaneously) rotating 
around the axis oj' perpendicular to the stick (the x-axis, from above), instead of the 
vertical axis, as shown in Fig. 8.20. What is the angular speed a/? 

Well, if a; is the angular speed of the stick around the vertical axis, then we may view 
the tip of the stick as instantaneously moving in a circle of radius £ sin 0 around the 
vertical axis oj. So oj(£ sin 0) is the speed of the tip of the stick. But we may also 
view the tip of the stick as instantaneously moving in a circle of radius t around a/. 
The speed of the tip is still oj(£ sin 0), so the angular speed about this axis is given by 
<Jl = w(f sin 0). Hence u/ = oj sin 0, which is simply the x-component of oj that we 
found above, right before eq. (8.31). The moment of inertia around u/ is mt 2 / 3, so 
the angular momentum has magnitude (raf? 2 /3)(wsin0), in agreement with eq. (8.31). 
And the direction is along the x-axis, as it should be. 

Note that although oj is not uniquely defined at any instant, L = J(rxp) dm certainly 
is. 9 Choosing oj to point vertically, as we did in the above solution, is in some sense 
the natural choice, because this oj does not change with time. 4k 

9 The non-uniqueness of uj arises from the fact that I y = 0 here. If all the moments are nonzero, 
then (L x , L y , L z ) = (I x lj x , I y oj y , I z oj z ) uniquely determines uj, for a given L. 



Figure 8.20 
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8.5 Euler’s equations 

Consider a rigid body instantaneously rotating around an axis u. (u> may change 
as time goes on, but all we care about for now is what it is at a given instant.) The 
angular momentum, L, is given by eq. (8.8) as 

L = lu, (8.36) 

where I is the inertial tensor, calculated with respect to a given set of axes (and u> 
is written in the same basis, of course). 

As usual, things are much nicer if we use the principal axes (relative to the 
chosen origin) as the basis vectors of our coordinate system. Since these axes are 
fixed with respect to the rotating object, they will of course rotate with respect to 
the fixed reference frame. In this basis, L takes the nice form, 

L = (I\U)\, I 2 U 2 , 13 U 3 ), (8.37) 

where w\ t 0 J 2 , and W3 are the components of u) along the principal axes. In other 
words, if you take the vector L in space and project it onto the instantaneous 
principal axes, then you get these components. 

On one hand, writing L in terms of the rotating principal axes allows us to write 
it in the nice form of (8.37). But on the other hand, writing L in this way makes it 
nontrivial to determine how it changes in time (since the principal axes themselves 
are changing). The benefits outweigh the detriments, however, so we will invariably 
use the principal axes as our basis vectors. 

The goal of this section is to find an expression for dL/dt, and to then equate 
this with the torque. The result will be Euler’s equations, eqs. (8.43). 


Derivation of Euler’s equations 

If we write L in terms of the body frame, then we see that L can change (relative 
to the lab frame) due to two effects. L can change because its coordinates in the 
body frame may change, and L can also change because of the rotation of the body 
frame. 

To be precise, let Lo be the vector L at a given instant. At this instant, imagine 
painting the vector Lo onto the body frame (so that Lo will then rotate with the 
body frame). The rate of change of L with respect to the lab frame may be written 
in the (identically true) way, 


dL d( L — Lo) dLo 
dt dt dt 


The second term here is simply the rate of change of a body-fixed vector, which we 
know is u; x Lo (which equals wxLat this instant). The first term is the rate of 
change of L with respect to the body frame, which we will denote by SL/St. So we 
end up with 


dL 5 L 

- =-b U> 

dt St 


L. 


(8.39) 
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This is actually a general statement, true for any vector in any rotating frame . 10 
There is nothing particular to L that we used in the above derivation. Also, there 
was no need to restrict ourselves to principal axes. 

In words, what we’ve shown is that the total change equals the change relative 
to the rotating frame, plus the change of the rotating frame relative to the fixed 
frame. Simply addition of changes. 

Let us now be specific and choose our body-axes to be the principal-axes. This 
will put eq. (8.39) in a very usable form. Using eq. (8.37), we have 

= — (iiwi, I 2 W 2 , 73 W 3 ) + (wi, u> 2 , ^ 3 ) x (I 1 U 1 , 12 W 2 , 13 U 3 ). (8.40) 

at ot 

This equation equates two vectors. As is true for any vector, these (equal) 
vectors have an existence that is independent of what coordinate system we choose 
to describe them with (eq. (8.39) makes no reference to a coordinate system). But 
since we’ve chosen an explicit frame on the right-hand side of eq. (8.40), we should 
choose the same frame for the left-hand side; we can then equate the components on 
the left with the components on the right. Projecting dL/dt onto the instantaneous 
principal axes, we have 

(Or©,- (S 3 ) = i (/iwi ’ /M ’ j3 “ 3)+( " i '" 2 '“' 3}x (' ri “ i ’ te,:rstt ’ 3 )' 

(8.41) 

Remark: The left-hand side looks nastier than it really is. At the risk of belaboring 
the point, consider the following (this is a remark that has to be read very slowly): We could 
have written the left-hand side as (d/dt)(Li, L 2 , L 3 ), but this might cause confusion as to 
whether the L, : refer to the components with respect to the rotating axes, or the components 
with respect to the fixed set of axes that coincide with the rotating principal axes at this 
instant. That is, do we project L onto the principal axes, and then take the derivative; or 
do we take the derivative and then project? The latter is what we mean in eq. (8.41). (The 
former is dL/St, by definition.) The way we’ve written the left-hand side of eq. (8.41), it’s 
clear that we’re taking the derivative first. We are, after all, simply projecting eq. (8.39) 
onto the principal axes. £ 

The time derivatives on the right-hand side of eq. (8.41) are 5{I\uj\)/8t = 
hui (because I\ is constant), etc. Performing the cross product and equating the 
corresponding components on each side yields the three equations, 

^ = A'&l. + (h ~ h)w3UJ2, 

= h ^2 + {h ~ h)viuJ3, 

( df ) = "^ 3 ^ 3 ~ (8.42) 

10 We will prove eq. (8.39) in another more mathematical way in Chapter 9. 
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Figure 8.21 


If we have chosen the origin of our rotating frame to be either a fixed point or the 
CM (which we will always do), then the results of Section 7.4 tell us that we may 
equate dL/dt with the torque, r. We therefore have 

Ti = I\(T\ + (I3 — 12)1^3^2, 
r 2 = I2U2 + (h ~ 

73 = I 3 W 3 + (/ 2 — I\)u 2 W\. (8.43) 

These are Euler’s equations. You need only remember one of them, because the 
other two can be obtained by cyclic permutation of the indices. 

Remarks: 

1. We repeat that the left- and right-hand sides of eq. (8.43) are components that are 
measured with respect to the instantaneous principal axes. Let’s say we do a problem, 
for example, where at all times n = 7*2 = 0, and T 3 equals some nonzero number. 
This doesn’t mean, of course, that r is a constant vector. On the contrary, r always 
points along the X 3 vector in the rotating frame, but this vector is changing in the 
fixed frame (unless x 3 points along u>). 

The two types of terms on the right-hand sides of eqs. (8.42) are the two types of 
changes that L can undergo. L can change because its components with respect to 
the rotating frame change, and L can also change because the body is rotating around 

2. Section 8.6.1 on the free symmetric top (viewed from the body frame) provides a good 
example of the use of Euler’s equations. Another interesting application is the famed 
“tennis racket theorem” (Problem 14). 

3. It should be noted that you never have to use Euler’s equations. You can simply start 
from scratch and use eq. (8.39) each time you solve a problem. The point is that 
we’ve done the calculation of dL/dt once and for all, so you can just invoke the result 
in eqs. (8.43). & 

8.6 Free symmetric top 

The free symmetric top is the classic example of an application of the Euler equa¬ 
tions. Consider an object which has two of its principal moments equal (with the CM 
as the origin). Let the object be in outer space, far from any external forces . 11 We 
will choose our object to have cylindrical symmetry around some axis (see Fig. 8.21), 
although this is not necessary (a square cross-section, for example, would yield two 
equal moments). The principal axes are then the symmetry axis and any two or¬ 
thogonal axes in the cross-section plane through the CM. Let the symmetry axis be 
chosen as the X 3 axis. Then our moments are I\ = I 2 = I, and I 3 . 

8.6.1 View from body frame 

Plugging I\ = I 2 = I into Euler’s equations, eqs. (8.43), with the t % equal to zero 
(since there are no torques, because the top is “free”), gives 

0 = Id) 1 + (I 3 — I)u> sU> 2 , 

11 Equivalently, the object is thrown up in the air, and we are traveling along on the CM. 
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0 = Id) 2 + (I — 

0 = / 3 W 3 . (8.44) 

The last equation says that CP 3 is a constant. If we then define 

ft = cn 3 , (8.45) 

the first two equations become 

(hi + tlu> 2 = 0, and ciq — Huq = 0. (8.46) 

Taking the derivative of the first of these, and then using the second one to eliminate 
h 2 , gives 

chi + fiV = 0, (8.47) 

and likewise for u 2 . This is a nice simple-harmonic equation. The solutions for (t ) 

and (by using eq. (8.46)) u> 2 (t) are 

u>i(t) = Acos(Qt + <f>), U 2 (t) = Asin(fii + (/>). (8.48) 


Therefore, oq (t) and are the components of a circle in the body frame. Hence, 
the u> vector traces out a cone around x 3 (see Fig. 8.22), with frequency fi, as 
viewed by someone standing on the body. The angular momentum is 

L = (Rui, I 2 OJ 2 , I 3 W 3 ) = (^IAcos(SU + (j>), IAsm(Qt + (f>), I3W3), (8.49) 

so L also traces out a cone around x 3 (see Fig. 8.22), with frequency fi, as viewed 
by someone standing on the body. 

The frequency, Q. in eq. (8.45) depends on the value of U 3 and on the geometry 
of the object. But the amplitude, A, of the w cone is determined by the initial 
values of uq and uq- 

Note that 0 may be negative (if / > I 3 ). In this case, u> traces out its cone in 
the opposite direction compared to the H > 0 case. 


Example (The earth): Let’s consider the earth to be our object. Then aq ss 

27t/( 1 day). 12 The bulge at the equator (caused by the spinning of the earth) makes 
I 3 slightly larger than J, and it turns out that (73 — 7) /I & 1/300. Therefore, eq. 
(8.45) gives M (1/300) 27t/( 1 day). So the ui vector should precess around its cone 
once every 300 days, as viewed by someone on the earth. The true value is more like 
400 days. The difference has to do with various things, including the non-rigidity of 
the earth. But at least we got an answer in the right ballpark. 

How do you determine the direction of w? Simply make an extended-time photograph 
exposure at night. The stars will form arcs of circles. At the center of all these circles 
is a point that doesn’t move. This is the direction of us. 

12 This isn’t quite correct, since the earth rotates 366 times for every 365 days (due to the motion 
around the sun), but it’s close enough for the purposes here. 
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How big is the uj cone, for the earth? Equivalently, what is the value of A in eq. 
(8.48)? Observation has shown that uj pierces the earth at a point on the order of 
10 m from the north pole. Hence, A/uj 3 « (10 m )/R E . The half-angle of the uj cone 
is therefore found to be only on the order of 10 -4 degrees. So if you use an extended¬ 
time photograph exposure one night to see which point in the sky stands still, and 
then if you do the same thing 200 nights later, you probably won’t be able to tell that 
they’re really two different points. 


8.6.2 View from fixed frame 

Now let’s see what our symmetric top looks like from a fixed frame. In terms of the 
principal axes, xi,X 2 ,X 3 , we have 


uj = (uqxi + W 2 X 2 ) + n> 3 x 3 , and 
L = /(cnixi + W 2 X 2 ) + / 3 W 3 X 3 . (8.50) 


Eliminating the (uqxi + W 2 X 2 ) term from these equations gives (in terms of the LI 
defined in eq. (8.45)) 



S' fixed frame, 
Q>0 (/ 3 >/) 

Figure 8.23 


L 



Q<0 (/ 3 </) 


L = I(uj + flx 3 ), or uj = jL — 12 x 3 , (8.51) 

where L = |L|, and L is the unit vector in the L direction. The linear relationship 
between L, uj, and X3, implies that these three vectors lie in a plane. Since there are 
no torques on the system, L remains constant. Therefore, uj and X 3 precess (as we 
will see below) around L, with the three vectors always coplanar. See Fig. 8.23 for 
the case I 3 > I (an oblate top, such as a coin), and Fig. 8.24 for the case I 3 < / (a 
prolate top, such as a carrot). 

What is the frequency of this precession, as viewed from the fixed frame? The 
rate of change of X 3 is uj x X 3 (because £3 is fixed in the body frame, so its change 
comes only from rotation around uj). Therefore, eq. (8.51) gives 

^ (jL - x x 3 = ^ jLj x x 3 . (8.52) 

But this is simply the expression for the rate of change of a vector rotating around 
the fixed vector uj = (L/I) L. The frequency of this rotation is |u>| = L/I. Therefore, 
X 3 precesses around the fixed vector L with frequency 

u=j, (8.53) 

in the fixed frame (and therefore uj does also, since it is coplanar with X 3 and L). 


Figure 8.24 
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Remarks: 

1. We just said that ui precesses around L with frequency L/I. What, then, is wrong 
with the following reasoning: “Just as the rate of change of x 3 equals u> x x 3 , the rate 
of change of w should equal u> x u>, which is zero. Hence, u> should remain constant.” 
The error is that the vector u> is not fixed in the body frame. A vector A must be 
fixed in the body frame in order for its rate of change to be given by w x A. 

2. We found in eqs. (8.49) and (8.45) that a person standing on the rotating body sees 
L (and u>) precess with frequency II = w 3 (/ 3 — I)/I around x 3 . But we found in 
eq. (8.53) that a person standing in the fixed frame sees x 3 (and u>) precess with 
frequency L/I around L. Are these two facts compatible? Should we have obtained 
the same frequency from either point of view? (Answers: yes, no). 

These two frequencies are indeed consistent, as can be seen from the following rea¬ 
soning. Consider the plane (call it S) containing the three vectors L, u>, and x 3 . We 
know from eq. (8.49) that S rotates with frequency flx 3 with respect to the body. 
Therefore, the body rotates with frequency —flx 3 with respect to S. And from eq. 
(8.53), S rotates with frequency (L/I )L with respect to the fixed frame. Therefore, 
the total angular velocity of the body with respect to the fixed frame (using the frame 
S as an intermediate step) is 


^ total 


jt - Ox 3 . 


(8.54) 


But from eq. (8.51), this is simply u>, as it should be. So the two frequencies in eqs. 
(8.45) and (8.53) are indeed consistent. 

For the earth, Cl = w 3 (/ 3 — I)/I and L/I are much different. L/I is roughly equal 
to T// 3 , which is essentially equal to w 3 . Cl, on the other hand is about (l/300)w 3 . 
Basically, an external observer sees u> precess around its cone at roughly the rate at 
which the earth spins. But it’s not exactly the same rate, and this difference is what 
causes the earth-based observer to see u> precess with a nonzero Cl. X 


8.7 Heavy symmetric top 

Consider now a heavy symmetrical top; that is, one that spins on a table, under the 
influence of gravity (see Fig. 8.25). Assume that the tip of the top is fixed on the 
table by a free pivot. We will solve for the motion of the top in two different ways. 
The first will use r = dL/dt. The second will use the Lagrangian method. 

8.7.1 Euler angles 


*3 



Figure 8.25 


For both of these methods, it is very convenient to use the Euler angles, 9 which 
are shown in Fig. 8.26 and are defined as follows. 

• 9: Let x 3 be the symmetry axis of the top. Define 9 to be the angle that x 3 
makes with the vertical axis z of the fixed frame. 

• 4>: Draw the plane orthogonal to x 3 . Let 3cj be the intersection of this plane 
with the horizontal x-y plane. Define <p to be the angle xq makes with the x 
axis of the fixed frame. 


fixed point 
z in body 



Figure 8.26 
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• 'ip: Let X 2 be orthogonal to X 3 and xi, as shown. Let frame S be the frame 
whose axes are xi, £ 2 , and £ 3 . Define ip to be the angle of rotation of the 
body around the £3 axis in frame S. (That is, ^£3 is the angular velocity of 
the body with respect to S .) Note that the angular velocity of frame S with 
respect to the fixed frame is <j >z + 0 £ 1 . 

The angular velocity of the body with respect to the fixed frame is equal to the 
angular velocity of the body with respect to frame S, plus the angular velocity of 
frame S with respect to the fixed frame. In other words, it is 

uj = ipx 3 + (<j> z + 0xi). (8.55) 

Note that the vector z is not orthogonal to £1 and £ 3 . It is often more convenient 
to rewrite uj entirely in terms of the orthogonal xi,X 2 ,X 3 basis vectors. Since z = 
cos 0 x 3 + sin 0 X 2 , eq. (8.55) gives 

uj = (ip + cp cos 0 )x 3 + <f> sin 0 X 2 + 0£i. (8.56) 

This form of uj is often more useful, because £ 1 , £ 2 , and £3 are principal axes of 
the body. (We are assuming that we are working with a symmetrical top, with 
I\ = I 2 = I■ Hence, any axes in the X 1 -X 2 plane are principal axes.) Although £1 
and X 2 are not fixed in the object, they are still good principal axes at any instant. 

8.7.2 Digression on the components of uj 

The previous expressions for uj look rather formidable, but there is a very helpful 
diagram we can draw (see Fig. 8.27) which makes it easier to see what is going on. 
Let’s talk a bit about this before returning to the original problem of the spinning 
top. The diagram is rather pithy, so we’ll go through it nice and slowly. 

In the following discussion, we will simplify things by setting 0 = 0. All the 
interesting features of uj remain. The ftx 1 component of uj in eqs. (8.55) and (8.56) 
simply arises from the easily-visualizable rising and falling of the top. We will 
therefore concentrate here on the more complicated issues, namely the components 
of uj in the plane of £ 3 , z, and £ 2 . 

With 0 = 0, Fig. 8.27 shows the vector uj in the X 3 -Z-X 2 plane (the way we’ve 
drawn it, £1 points into the page, in contrast with Fig. 8.26). This is an extremely 
useful diagram, and we will refer to it many times in the problems for this chapter. 
There are numerous comments to be made on it, so let’s just list them out. 

1. If someone asks you to “decompose” uj into pieces along z and £ 3 , what would 
you do? Would you draw the lines perpendicular to these axes to obtain the 
lengths shown (which we will label as uj z and UJ3), or would you draw the lines 
parallel to these axes to obtain the lengths shown (which we will label as LI 
and uj')? There is no “correct” answer to this question. The four quantities, 
uj z , uJ;i , Q, uj' simply represent different things. We will interpret each of these 
below, along with uj 2 (the projection of uj along £ 2 ). It turns out that LI and 
uj' are the frequencies that your eye can see the easiest, while uj 2 and W3 are 
what you want to use when you’re doing calculations involving the angular 
momentum. (And as far as I can see, uj z is not of much use.) 
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Figure 8.27 


2. Note that it is true that 

u> = u/x 3 + flz, (8.57) 

but it is not true that u> = u z z + W 3 X 3 . Another true statement is 

u> = 0 ) 3 X 3 + W 2 X 2 . (8.58) 

3. In terms of the Euler angles, we see (by comparing eq. (8.57) with eq. (8.55), 
with 0 = 0 ) that 

J = 4 ’, 

0 = 4>. (8.59) 

And we also have (by comparing eq. (8.58) with eq. (8.56), with 0 = 0) 

(O 3 = V> + ^cos0 = u' + Ocos 0, 

U 2 = </>sin 0 = II sin 0 . (8.60) 

These are also clear from Fig. 8.27. 

There is therefore technically no need to introduce the new u> 2 , 0 ) 3 , O, 0 / 
definitions in Fig. 8.27, since the Euler angles are quite sufficient. But we will 
be referring to this figure many times, and it is a little easier to refer to these 
omega’s than to the various combinations of Euler angles. 

4. II is the easiest of these frequencies to visualize. It is simply the frequency 
of precession of the top around the vertical z axis . 13 In other words, the 

13 Although we’re using the same letter, this Q doesn’t have anything to do with the fi defined 
in eq. (8.45), except for the fact that they both represent a precession frequency. 
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symmetry axis £3 traces out a cone around the z axis with frequency LI. (Note 
that this precession frequency is not u z .) Let’s prove this. 

The vector u> is the vector which gives the speed of any point (at position r) 
fixed in the top aswxr. Therefore, since the vector £3 is fixed in the top, we 
may write 

^ = u) x £3 = (u/£ 3 + Liz) x £3 = (fiz) x £ 3 . (8.61) 

But this is precisely the expression for the rate of change of a vector rotating 
around the z axis, with frequency LI. (This was exactly the same type of proof 
as the one leading to eq. (8.52).) 

Remark: In the derivation of eq. (8.61), we’ve basically just stripped off a certain 
part of u> that points along the £3 axis, because a rotation around £3 contributes 
nothing to the motion of £ 3 . Note, however, that there is in fact an infinite number 
of ways to strip off a piece along £ 3 . For example, we can also break lo up as, say, 
u3 = W 3 X 3 + tv 2 x 2 . We then obtain dic 3 /dt = (w 2 x 2 ) x £ 3 , which means that x 3 is 
instantaneously rotating around £ 2 with frequency tv 2 . Although this is true, it is not 
as useful as the result in eq. (8.61), because the x 2 axis changes with time. The point 
here is that the instantaneous angular velocity vector around which the symmetry 
axis rotates is not well-defined (Problem 2 discusses this issue ). 14 But the z-axis is 
the only one of these angular velocity vectors that is fixed. When we look at the top, 
we therefore see it precessing around the z-axis. & 

5. uJ is also easy to visualize. Imagine that you are at rest in a frame that rotates 
around the z-axis with frequency LI. Then you will see the symmetry axis of the 
top remain perfectly still, and the only motion you will see is the top spinning 
around this axis with frequency u/. (This is true because uj = u /£3 + Liz, and 
the rotation of your frame causes you to not see the Liz part.) If you paint a 
dot somewhere on the top, then the dot will trace out a fixed tilted circle, and 
the dot will return to, say, its maximum height at frequency uJ. 

Note that someone in the lab frame will see the dot undergo a rather com¬ 
plicated motion, but she must observe the same frequency at which the dot 
returns to its highest point. Hence, u' is something quite physical in the lab 
frame, also. 

6 . 0 J 3 is what you use to obtain the component of L along £ 3 , because L 3 = / 3 W 3 . 
It is not quite as easy to visualize as II and u', but it is the frequency with 
which the top instantaneously rotates, as seen by someone at rest in a frame 
that rotates around the £2 axis with frequency w 2 . (This is true because 
u = w 2 £2 + ^£ 3 , and the rotation of the frame causes you to not see the w 2 £ 2 
part.) This rotation is a little harder to see, because the £ 2 axis changes with 
time. 

14 The instantaneous anguiar velocity of the whole body is well defined, of course. But if you just 
look at the symmetry axis by itself, then there is an ambiguity (see footnote 9). 
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There is one physical scenario in which u>3 is the easily observed frequency. 
Imagine that the top is precessing around the z axis at constant 6 , and imagine 
that the top has a frictionless rod protruding along its symmetry axis. If 
you grab the rod and stop the precession motion (so that the top is now 
spinning around its stationary symmetry axis), then this spinning will occur 
at frequency UJ3 . This is true because when you grab the rod, you apply a 
torque in only the (negative) £2 direction. Therefore, you don’t change L3, 
and hence you don’t change 023. 

7 . U2 is similar to 013, of course. U2 is what you use to obtain the component 
of L along £2, because L2 = h^2- It is the frequency with which the top 
instantaneously rotates, as seen by someone at rest in a frame that rotates 
around the £3 axis with frequency L03. (This is true because u) = ^£2 + 223X3, 
and the rotation of the frame causes you to not see the 023£3 part.) Again, 
this rotation is a little harder to see, because the £3 axis changes with time. 

8. u z is not very useful (as far as I can see). The most important thing to note 
about it is that it is not the frequency of precession around the z-axis, even 
though it is the projection of u> onto z. The frequency of the precession is fi, 
as we found above in eq. ( 8 . 61 ). A true, but somewhat useless, fact about 
u z is that if someone is at rest in a frame that rotates around the z axis with 
frequency u> z , then she will see all points in the top instantaneously rotating 
around the £-axis with frequency uj x , where oj x is the projection of u> onto the 
horizontal £ axis. (This is true because u> = c o x it + lu z z. and the rotation of 
the frame causes you to not see the u> z z part.) 

8.7.3 Torque method 

This method of solving the heavy top will be straightforward, although a little 
tedious. We include it here to ( 1 ) show that this problem can be done without 
resorting to Lagrangians, and to ( 2 ) get some practice using t = dL/dt. 

We will make use of the form of a; given in eq. ( 8 . 56 ), because there it is broken 
up into the principal-axis components. For convenience, define /3 = ip + 0cos 9. so 
that 

u = /L£ 3 + </>sin6t£2 + 0£i. ( 8 . 62 ) 

Note that we’ve returned to the most general motion, where 0 is not necessarily 
zero. 

We will choose the tip of the top as our origin, which is assumed to be fixed on 
the table. 15 Let the principal moments relative to this origin be I\ = I2 = /, and 
I3. The angular momentum of the body is then 

L = I3I3-X.3 + /</>sinft£ 2 + I&k\. ( 8 . 63 ) 

We must now calculate dE/dt. What makes this nontrivial is the fact that the 
£1, £2, and £3 unit vectors change with time (they change with 9 and (f>). But let’s 

15 We could use the CM as our origin, but then we would have to include the complicated forces 
acting at the pivot point, which is difficult. 
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forge ahead and take the derivative of eq. (8.63). Using the product rule (which 
works fine with the product of a scalar and a vector), we have 


dL d/3 „ d((psinO) ^ dO „ 

it = /3 ia X3+/ ^«— X2+/ * X > 

+h AN + i^e d N + K-' 


dt 

Using a little geometry, you can show 
rix : . 

dt 
dx 2 


dt 


—Ox. 2 + </>sin 0 xi, 


dt 


(8.64) 


dxi 


(8.65) 


As an exercise, prove these by making use of Fig. 8.26. In the first equation, for 
example, show that a change in 0 causes £3 to move a certain distance in the x 2 
direction; and show that a change in <f> causes £3 to move a certain distance in the 
£1 direction. Plugging eqs. (8.65) into eq. (8.64) gives 


— = J 3 ^£ 3 +(l^ sin 0 + 2I0j> cos 0 - I 3 0O)* 2 

+(l0 ~ I<j) 2 smOcosO + I 3 /30sin6^£i. ( 8 . 66 ) 


The torque on the top arises from gravity pulling down on the CM. r points 
in the £1 direction and has magnitude Mgl sin 0, where I is the distance from the 
pivot to CM. Equating r with dL/dt gives 


ir °, 


(8.67) 


for the £3 component. Therefore, $ is a constant, which we will call uj 3 (an obvious 
label, in view of eq. (8.62)). The other two components of r = dL/dt then give 


I4)smO + 0(2I<j>cos6-I 3 uj3 ) = 0, 

(Mgl + Icf) 2 cos 6 — hu 3 (f)) sin0 = 10. ( 8 . 68 ) 


We will wait to fiddle with these equations until we have derived them again using 
the Lagrangian method. 


8.7.4 Lagrangian method 

Eq. (8.13) gives the kinetic energy of the top as T = ■ L. Eqs. (8.62) and (8.63) 

give (using ^ + </>cos 0 instead of the shorthand /?) 16 

T=^u-L= l -h(i> + j>cos0) 2 + ^I(j> 2 sin 2 0 + 0 2 ). (8.69) 

16 It was ok to use /3 in Subsection 8.7.3; we introduced it simply because it was quicker to write. 
But we can’t use 6 here, because it depends on the other coordinates, and the Lagrangian method 
requires the use of independent coordinates. (The variational proof back in Chapter 5 assumed this 
independence.) 
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The potential energy is 

V = MglcosO, (8.70) 

where I is the distance from the pivot to CM. The Lagrangian is £ = T—V (we’ll use 
"C here to avoid confusion with the angular momentum, “L”), and so the equation 
of motion obtained from varying ip is 


id dC _ dC 
dt dip dip 


(8.71) 


Therefore, ip + <pcos9 is a constant. Call it u 3 . The equations of motion obtained 
from varying (p and 9 are then 

d dC dC d ( • ■ 2 ^ 

Itef=ai =*• lA hu 3cos9 + W s,„ e)=o, 

‘'p - 'Pi —" Id=(Mgt + I4> 2 <x>sd-huz<l>)mid. (8.72) 

These are equivalent to eqs. ( 8 . 68 ), as you can check. Note that there are two 
conserved quantities, arising from the facts that dC/dip and d£/d<p equal zero. The 
conserved quantities are simply the angular momenta in the X 3 and z directions, 
respectively. (There is no torque in the plane spanned by these vectors, since the 
torque points in the xq direction.) 


8.7.5 Gyroscope with 9 = 0 

A special case of eqs. ( 8 . 68 ) occurs when 9 = 0. In this case, the first of eqs. ( 8 . 68 ) 
says that 0 is a constant. The CM of the top therefore undergoes uniform circular 
motion in a horizontal plane. Let 0 = cp be the frequency of this motion (this is the 
same notation as in eq. (8.59)). Then the second of eqs. ( 8 . 68 ) says that 

m 2 cos 9 - I 3 u 3 n + Mgi = 0. (8.73) 

This quadratic equation may be solved to yield two possible precessional frequencies 
for the top. (Yes, there are indeed two of them, provided that u 3 is greater than a 
certain minimum value.) 

The previous pages in this “Heavy Symmetric Top” section have been a bit 
abstract. So let’s now pause for a moment, take a breather, and rederive eq. (8.73) 
from scratch. That is, we’ll assume 9 = 0 from the start of the solution, and solve 
things by simply finding L and using r = dL/dt , in the spirit of Section 8.4.2. 

The following Gyroscope example is the classic “top” problem. We’ll warm up 
by solving it in an approximate way. Then we’ll do it for real. 


Example (Gyroscope): A symmetric top of mass M has its CM a distance I from 
its pivot. The moments of inertia relative to the pivot are I\= I 2 =I and I 3 . The top 
spins around its symmetry axis with frequency cv 3 (in the language of Section 8.7.2), 
and initial conditions have been set up so that the CM precesses in a circle around 
the vertical axis. The symmetry axis makes a constant angle 9 with the vertical (see 
Fig. 8.28). 



L 



Figure 8.29 
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(a) Assuming that the angular momentum due to uj 3 is much larger than any other 
angular momentum in the problem, find an approximate expression for the fre¬ 
quency, Cl, of precession. 

(b) Now do the problem exactly. That is, find Cl by considering all of the angular 
momentum. 


Solution: 


(a) The angular momentum (relative to the pivot) due to the spinning of the top 
has magnitude L 3 = I 3 U 3 , and it is directed along x 3 . Let’s label this angular 
momentum vector as L 3 = L3X3 . As the top precesses, L 3 traces out a cone 
around the vertical axis. So the tip of L 3 moves in a circle of radius L :i sin 9. The 
frequency of this circular motion is the frequency of precession, Cl. So dL 3 /dt, 
which is the velocity of the tip, has magnitude 

Cl(L 3 sin 9 ) = J 3 W 3 sin 9, (8.74) 


and is directed into the page. 

The torque relative to the pivot point is due to gravity acting on the CM, so 
it has magnitude Mgl sin 9. It is directed into the page. Therefore, t = dL/dt 
gives 


(8.75) 


Note that this is independent of 9. And it is inversely proportional to u) 3 . 


(b) The error in the above analysis is that we omitted the angular momentum arising 
from the X 2 (defined in Section 8.7.1) component of the angular velocity due to 
the precession of the top around the z-axis. This component has magnitude 
flsin0 . 17 The angular momentum due to this angular velocity component has 
magnitude 

L 2 = IClsm9, (8.76) 


and is directed along X 2 . Let’s label this as L 2 — 7 . 2 X 2 - The total L = L 2 + L 3 
is shown in Fig. 8.29. 

Only the horizontal component of L (call it Lj_) changes. From the figure, Lj_ is 
the difference in lengths of the horizontal components of L 3 and L 2 . Therefore, 


L _l = L 3 sin9 — L 2 cos9 = I 3 cj 3 sm 9 — IClsin9cos9. (8.77) 


The magnitude of the rate of change of L is simply CIL± = Cl{I.iUj 3 sin 6 — 
I Cl sin 9 cos 0 ). 18 Equating this with the torque, Mgl sin 9, gives 

ICl 2 cos 9 - I 3 U 3 CI + Mgl = 0, (8.78) 


in agreement with eq. (8.73), as we wanted to show. The quadratic formula 
quickly gives the two solutions for Cl, which may be written as 


Cl ± 


I 3 LO 3 (, , r, AM I gl cos 9 \ 

21 cos 9 \ L± \ Il<4 ] 


(8.79) 


17 The angular velocity due to the precession is Cli. We may break this up into components 
along the orthogonal directions X2 and X3. The Cl cos 9 component along X3 was absorbed into the 
definition of 013 (see Fig. 8.27). 

18 This result can also be obtained in a more formal way. Since L precesses with angular velocity 
Clz., the rate of change of L is dL/dt = Cli x L. This cross product is easily computed in the X2-X3 
basis, and gives the same result. 
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Remark: Note that if 0 = 7r/2, then eq. (8.78) is actually a linear equation, so there 
is only one solution for Q, which is the one in eq. (8.75). L2 points vertically, so it 
doesn’t change. Only L3 contributes to dL/dt. For this reason, a gyroscope is much 
easier to deal with when its symmetry axis is horizontal. & 


The two solutions in eq. (8.79) are known as the fast-precession and slow- 
precession frequencies. For large tu 3 , you can show that the slow-precession 


frequency is 




Mgi 

I3U3 


(8.80) 


in agreement with the solution found in eq. ( 8 .75 ). 19 This task, along with 
many other interesting features of this problem (including the interpretation of 
the fast-precession frequency, f2+), is the subject of Problem 16, which you are 
strongly encouraged to do. 


8.7.6 Nutation 

Let us now solve eqs. ( 8 . 68 ) in a somewhat more general case, where 6 is allowed to 
vary slightly. That is, we will consider a slight perturbation to the circular motion 
associated with eq. (8.73). We will assume W 3 is large here, and we will assume that 
the original circular motion corresponds to the slow precession, so that <f is small. 
Under these assumptions, we will find that the top will bounce around slightly as it 
travels (roughly) in a circle. This bouncing is known as nutation. 

Since 0 and f are small, we can (to a good approximation) ignore the quadratic 
terms in eqs. ( 8 . 68 ) and obtain 

Iif sin 6 — 0 / 3 W 3 = 0 , 

(Mgl - sin 9 = 16. (8.81) 

We must somehow solve these equations for 6 {t) and <f(t). Taking the derivative 
of the first equation and dropping the quadratic term gives 9 = (I sin d/TjWy) dPf/dt 2 . 
Substituting this into the second equation gives 

+ u n^~^s) = 0 , (8.82) 

where 

V n = h( p and U, - (8.83) 

1 1 3W3 

are, respectively, the frequency of nutation (as we shall soon see), and the slow- 
precession frequency given in eq. (8.75). Shifting variables to y = f — U s in eq. 
(8.82) gives us a nice harmonic-oscillator equation. Solving this and then shifting 
back to if yields 

<f(t) = + Acos(u> n t + 7 ), (8.84) 

19 This is fairly clear. If 0J3 is large enough compared to O, then we can ignore the first term in eq. 
(8.78). That is, we can ignore the effects of L2, which is exactly what we did in the approximate 
solution in part (a). 
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where A and 7 are determined from initial conditions. Integrating this gives 

0(f) = Ll s t + ^ sin(u; n f + 7 ), (8.85) 

plus an irrelevant constant. 

Now let’s solve for 9(t). Plugging 0(f) into the first of eqs. (8.81) gives 

9(t) = — A Un sm(o; n f + 7 ) = —A sin 9 sin(u; n f + 7 ). ( 8 . 86 ) 

Since 9(t) doesn’t change much, we may set sin 9 s=s sin 9 0 . where 9o is, say, the 
initial value of 9(t). (Any errors here are second-order effects in small quantities.) 
Integration then gives 

9(t) = B + sincos (u> n t + 7 ), (8.87) 

where B is a constant of integration. 

Eqs. (8.85) and (8.87) show that both 0 (neglecting the uniform Ll s t part) and 9 
oscillate with frequency u n , and with amplitudes inversely proportional to uj n . Note 
that eq. (8.83) says uj n grows with 0 * 3 . 


Example (Sideways kick): Assume that uniform circular precession is initially 
taking place with 6 = 9 0 and 0 = f l s . You then give the top a quick kick along 
the direction of motion, so that 0 is now equal to fl s + Af l (Af l may be positive or 
negative). Find 0(f) and 9(t). 

Solution: This is simply an exercise in initial conditions. We are given the initial 
values for 0, 9, and 9. So we will need to solve for the unknowns A, B and 7 in eqs. 
(8.84), ( 8 . 86 ), and (8.87). 9 is initially zero, so eq. ( 8 . 86 ) gives 7 = 0 . And 0 is 
initially f l s + All, so eq. (8.84) gives A = All Finally, 9 is initially 9 0 , so eq. (8.87) 
gives B = 9o — (Af l/iv n ) sin 0 O . Putting it all together, we have 

0 (f) = fl s t + sinuijjt, 

9(t) = ^9o — —— sin0 o ^ + — sin 6 >o^ cosw^t. ( 8 . 88 ) 

And for future reference (in the problems for this chapter), we’ll also list the deriva¬ 
tives, 

0(f) = + AficOSWnt, 

9(t) = — Afl sin 9o sin uj n t. (8.89) 


Remarks: 

(a) With the initial conditions we have chosen, eq. (8.88) shows that 9 always stays on one 
side of 0o■ If Af1 > 0, then 9(t) < 9o (that is, the top is always at a higher position, 
since 6 is measured from the vertical). If Afl < 0, then 9(t) > 60 (that is, the top is 
always at a lower position). 
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(b) The sin 0 O coefficient of the cos a i n t term in eq. (8.88) implies that the amplitude of 
the 0 oscillation is sin Oo times the amplitude of the 0 oscillation. This is precisely the 
factor needed to make the CM travel in a circle around its average precessing position 
(because a change in 6 causes a displacement of l dO, whereas a change in 0 causes a 
displacement of t sin 0 o d(f>). £ 
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8.8 Exercises 

Section 8.1: Preliminaries concerning rotations 

1. Rolling wheel * 

A wheel with spokes rolls on the ground. A stationary camera takes a picture 
of it, from the side. Do to the nonzero exposure time of the camera, the spokes 
will generally appear blurred. At what location (locations) in the picture does 
(do) a spoke (the spokes) not appear blurred? 

Section 8.2: The inertia tensor 

2. Inertia tensor * 

Calculate the rx(wxr) double cross-product in eq. (8.7) by using the vector 
identity, 

A x (B x C) = (A • C)B - (A • B)C. (8.90) 


Section 8.3: Principal axes 

3. Tennis racket theorem ** 

Problem 14 gives the statement of the “tennis racket theorem,” and the solu¬ 
tion there involves Euler’s equations. 

Demonstrate the theorem here by using conservation of L 2 and conservation 
of rotational kinetic energy in the following way. Produce an equation which 
says that if u 2 and U 3 (or u>\ and 0 J 2 ) start small, then they must remain small. 
And produce the analogous equation which says that if uq and u 3 start small, 
then they need not remain small . 20 

Section 8 . 4 : Two basic types of problems 

4. Rotating axle ** 

Two wheels (with moment of inertia I) are connected by a massless axle of 
length l , as shown in Fig. 8.30. The system rests on a frictionless surface, 
and the wheels rotate with frequency u around the axle. Additionally, the 
whole system rotates with frequency LI around the vertical axis through the 
center of the axle. What is largest value of LI for which both wheels stay on 
the ground? 

Section 8.7: Heavy symmetric top 

5. Relation between D and u/ ** 

Initial conditions have been set up so that a symmetric top undergoes preces¬ 
sion, with its symmetry axis always making an angle 0 with the vertical. The 
top has mass M, and the principal moments are Is and I = I\ = I 2 - The CM 

20 It’s another matter to show that they actually won’t remain small. But don’t bother with that 
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is a distance i from the pivot. In the language of Fig. 8.27, show that oj' is 
related to II by 


Mgl 


(I~h\ 


(8.91) 


Note: You could just plug u >3 = u / + It cos 9 (from eq. (8.60)) into eq. (8.73), 
and then solve for J. But solve this problem from scratch, using r = dL/dt. 

6. Sliding lollipop *** 

Consider a lollipop made of a solid sphere of mass to and radius r, which 
is radially pierced by massless stick. The free end of the stick is pivoted on 
the ground, which is frictionless (see Fig. 8.31). The sphere slides along the 
ground (keeping the same contact point on the sphere), with its center moving 
in a circle of radius R, with frequency fh 

Show that the normal force between the ground and the sphere is j% = 
mg + mrLl 2 (which is independent of R). Solve this by: 



Figure 8.31 


(a) Using a simple F = ma argument. 21 

(b) Using a (more complicated) r = dE/dt argument. 

7. Rolling wheel and axle *** 

A massless axle has one end attached to a wheel (which is a uniform disc of 
mass to and radius r), with the other end pivoted on the ground (see Fig. 8.32). 
The wheel rolls on the ground without slipping, with the axle inclined at an 
angle 9. The point of contact with the ground traces out a circle with frequency 

n. 



Figure 8.32 


(a) Show that u) points horizontally to the right (at the instant shown), with 
magnitude u = Q/ tan 9. 

(b) Show that the normal force between the ground and the wheel is 

N = mg cos 2 9 + mrfl 2 ^ cos 3 9 + ^ cos 9 sin 2 9^j . (8.92) 


21 This method happens to work here, due to the unusually nice nature of the sphere’s motion. 
For more general motion (for example, in Problem 21, where the sphere is spinning), you must use 
f = dL/dt. 





Figure 8.33 
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8.9 Problems 

Section 8.1: Preliminaries concerning rotations 

1. Fixed points on a sphere ** 

Consider a transformation of a rigid sphere into itself. Show that two points 
on the sphere end up where they started. 

2. Many different co’s * 

Consider a particle at the point (a, 0,0), with velocity (0, u,0). This particle 
may be considered to be rotating around many different u> vectors passing 
through the origin. (There is no one “correct” u>.) Find all the possible u>’s. 
(That is, find their directions and magnitudes.) 

3. Rolling cone ** 

A cone rolls without slipping on a table. The half-angle at the vertex is a, 
and the axis of the cone has length h (see Fig. 8.33). Let the speed of the 
center of the base (label this as point P) be v. What is the angular velocity 
of the cone with respect to the lab frame (at the instant shown)? 

There are many ways to do this problem, so you are encouraged to take a look 
at the three given solutions, even if you solve it. 

Section 8.2: The inertia tensor 

4. Parallel-axis theorem 

Let (X,Y,Z) be the position of an object’s CM, and let (x',y',z') be the 
position relative to the CM. Prove the parallel-axis theorem, eq. (8.17), by 
setting x = X + x', y = Y + y', and z = Z + z' in eq. (8.8). 

Section 8.3: Principal axes 

5. Existence of principal axes for a pancake * 

Given a pancake object in the x-y plane, show that there exist principal axes 
by considering what happens to the integral / xy as the coordinate axes are 
rotated about the origin. 

6. Symmetries and principal axes for a pancake ** 

A rotation of the axes in the x-y plane through an angle 0 transforms the 
coordinates according to 

( COS * ^ ) ( 1 ) . (8.93) 

l — sin 0 cos 9 I \ V I 

Use this to show that if a pancake object in the x-y plane has a symmetry 
under a rotation through 9 ^ 7 r, then all axes (through the origin) in the plane 
are principal axes. 
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7. Rotating square * 

Here’s an exercise in geometry. Theorem 8.5 says that if the moments of inertia 
of two principal axes are equal, then any axis in the plane of these axes is a 
principal axis. This means that the object will rotate happily about any axis 
in this plane (no torque is needed). Demonstrate this explicitly for four masses 
m in the shape of a square (which obviously has two moments equal), with 
the CM as the origin (see Fig. 8.34). Assume that the masses are connected 
with strings to the axis, as shown. Your task is to show that the tensions in 
the strings are such that there is no torque about the center of the square. 

8. A nice cylinder * 

What must the ratio of height to diameter of a cylinder be so that every axis 
is a principal axis (with the CM as the origin)? 

Section 8.4 ■' Two basic types of problems 

9. Rotating rectangle * 

A flat uniform rectangle with sides of length a and b sits it space (not rotating). 
You strike the corners at the ends of one diagonal, with equal and opposite 
forces (see Fig. 8.35). Show that the resulting initial u> points along the other 
diagonal. 

10. Rotating stick ** 

A stick of mass m and length i spins with frequency u around an axis, as 
shown in Fig. 8.36. The stick makes an angle 9 with the axis and is pivoted 
at its center. It is kept in this motion by two strings which are perpendicular 
to the axis. What is the tension in the strings? 

11. Another rotating stick ** 

A stick of mass m and length i is arranged to have its CM motionless and its 
top end slide in a circle on a frictionless rail (see Fig. 8.37). The stick makes 
an angle 9 with the vertical. What is the frequency of this motion? 

12. Spherical pendulum ** 

Consider a pendulum made of a massless rod of length £ and a point mass m. 
Assume conditions have been set up so that the mass moves in a horizontal 
circle. Let 9 be the constant angle the rod makes with the vertical. Find the 
frequency, H, of this circular motion in three different ways. 

(a) Use F = ma. (The net force accounts for the centripetal acceleration.) 22 

(b) Use t = dL/dt with the pendulum pivot as the origin. 

(c) Use t = dL/dt with the mass as the origin. 

22 This method works only if you have a point mass. With an extended object, you have to use 
one of the following methods involving torque. 



Figure 8.34 



Figure 8.35 



Figure 8.36 



Figure 8.37 
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Figure 8.38 



Figure 8.39 


13. Rolling in a cone ** 

(a) A fixed cone stands on its tip, with its axis in the vertical direction. The 
half-angle at the vertex is 6 . A particle of negligible size slides on the 
inside frictionless surface of the cone (see Fig. 8.38). 

Assume conditions have been set up so that the particle moves in a circle 
at height h above the tip. What is the frequency, LI, of this circular 
motion? 

(b) Assume now that the surface has friction, and a small ring of radius r 
rolls without slipping on the surface. Assume conditions have been set 
up so that ( 1 ) the point of contact between the ring and the cone moves 
in a circle at height h above the tip, and ( 2 ) the plane of the ring is at 
all times perpendicular to the line joining the point of contact and the 
tip of the cone (see Fig. 8.39). 

What is the frequency, LI, of this circular motion? (You may work in 
the approximation where r is much less than the radius of the circular 
motion, htanO.) 



Figure 8.40 


Section 8.5: Euler’s equations 

14. Tennis racket theorem *** 

If you try to spin a tennis racket (or a book, etc.) around any of its three 
principal axes, you will find that different things happen with the different 
axes. Assuming that the principal moments (relative to the CM) are labeled 
according to 1% > I 2 > I 3 (see Fig. 8.40), you will find that the racket will 
spin nicely around the xi and X 3 axes, but it will wobble in a rather messy 
manner if you try to spin it around the X 2 axis. 

Verify this claim experimentally with a book (preferably lightweight, and 
wrapped with a rubber band), or a tennis racket (if you happen to study 
with one on hand). 

Verify this claim mathematically. The main point here is that you can’t start 
the motion off with u> pointing exactly along a principal axis. Therefore, what 
you want to show is that the motion around the xi and X 3 axes is stable (that 
is, small errors in the initial conditions remain small); whereas the motion 
around the X 2 axis is unstable (that is, small errors in the initial conditions 
get larger and larger, until the motion eventually doesn’t resemble rotation 
around the £2 axis ). 23 Your task is to use Euler’s equations to prove these 
statements about stability. (Exercise 3 gives another derivation of this result.) 


23 If you try for a long enough time, you will eventually be able to get the initial w pointing close 
enough to X2 so that the book will remain rotating (almost) around X2 for the entire time of its 
flight. There is, however, probably a better use for your time, as well as for the book... 
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Section 8 . 6 : Free symmetric top 

15. Free-top angles * 

In Section 8.6.2, we showed that for a free symmetric top, the angular momen¬ 
tum L, the angular velocity uj, and the symmetry axis £3 all lie in a plane. 
Let a be the angle between X 3 and L, and let fJ be the angle between X 3 and 
u (see Fig. 8.41). Find the relationship between a and (3 in terms of the 
principal moments, / and I 3 . 

Section 8.7: Heavy symmetric top 

16. Gyroscope ** 

This problem deals with the gyroscope example in Section 8.7.5, and uses the 
result for fi in eq. (8.79). 


L 

(0 



Figure 8.41 


(a) What is the minimum 103 for which circular precession is possible? 

(b) Let CJ 3 be very large, and find approximate expressions for fl±. The 
phrase “very large” is rather meaningless, however. What mathematical 
statement should replace it? 

17. Many gyroscopes ** 

N identical plates and massless sticks are arranged as shown in Fig. 8.42. 
Each plate is glued to the stick on its left. And each plate is attached by a 
free pivot to the stick on its right. (And the leftmost stick is attached by a 
free pivot to a pole.) You wish to set up a circular precession with the sticks 
always forming a straight horizontal line. What should the relative angular 
speeds of the plates be so that this is possible? 

18. Heavy top on slippery table * 

Solve the problem of a heavy symmetric top spinning on a frictionless table 
(see Fig. 8.43). You may do this by simply stating what modifications are 
needed in the derivation in Section 8.7.3 (or Section 8.7.4). 

19. Fixed highest point ** 

Consider a top made of a uniform disc of radius R, connected to the origin by 
a massless stick (which is perpendicular to the disc) of length l. Paint a dot 
on the top at its highest point, and label this as point P (see Fig. 8.44). You 
wish to set up uniform circular precession, with the stick making a constant 
angle 6 with the vertical, and with P always being the highest point on the 
top. What relation between R and i must be satisfied for this motion to be 
possible? What is the frequency of precession, ffi 


C3 



Figure 8.42 



Figure 8.43 



Figure 8.44 
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Figure 8.45 



Figure 8.46 



Figure 8.47 



Figure 8.48 


20. Basketball on rim *** 

A basketball rolls without slipping around a basketball rim in such a way that 
the contact points trace out a great circle on the ball, and the CM moves 
around in a horizontal circle with frequency LI. The radii of the ball and rim 
are r and R, respectively, and the ball’s radius to the contact point makes an 
angle 9 with the horizontal (see Fig. 8.45). Assume that the ball’s moment 
of inertia around its center is I = (2/3) mr 2 . Find LI. 

21. Rolling lollipop *** 

Consider a lollipop made of a solid sphere of mass m and radius r, which is 
radially pierced by massless stick. The free end of the stick is pivoted on the 
ground (see Fig. 8.46). The sphere rolls on the ground without slipping, with 
its center moving in a circle of radius R, with frequency Ll. 

(a) Find the angular velocity vector, u. 

(b) What is the normal force between the ground and the sphere? 

22. Rolling coin *** 

Initial conditions have been set up so that a coin of radius r rolls around in a 
circle, as shown in Fig. 8.47. The contact point on the ground traces out a 
circle of radius R, and the coin makes a constant angle 9 with the horizontal. 
The coin rolls without slipping. (Assume that the friction with the ground is 
as large as needed.) What is the frequency of the circular motion of the contact 
point on the ground? Show that such motion exists only if R > (5/6)r cos 9. 

23. Wobbling coin **** 

If you spin a coin around a vertical diameter on a table, it will slowly lose 
energy and begin a wobbling motion. The angle between the coin and the 
table will decrease, and eventually the coin will come to rest. Assume that 
this process is slow, and consider the motion when the coin makes an angle 9 
with the table (see Fig. 8.48). You may assume that the CM is essentially 
motionless. Let R be the radius of the coin, and let 12 be the frequency at 
which the point of contact on the table traces out its circle. Assume that the 
coin rolls without slipping. 


(a) Show that the angular velocity vector of the coin is = Ll sin 0x 2 , where 
X 2 points upward along the coin, directly away from the contact point 
(see Fig. 8.27). 


(b) Show that 


n = 2 


rr 

V Rsin9 


(8.94) 


(c) Show that Abe (or Tom, Franklin, George, John, Dwight, Sue, or Saca- 
gawea) appears to rotate, when viewed from above, with frequency 


I 9 

V -Rsin# 


2(1 — cos 9) 


(8.95) 
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24. Nutation cusps ** 

(a) Using the notation and initial conditions of the example in Section 8.7.6, 
prove that kinks occur in nutation if and only if AU = ±U S . (A kink is 
where the plot of 9(t) vs. r/>(t) has a discontinuity in its slope.) 

(b) Prove that these kinks are in fact cusps. (A cusp is a kink where the plot 
reverses direction in the 4>-6 plane.) 

25. Nutation circles ** 

(a) Using the notation and initial conditions of the example in Section 8.7.6, 

and assuming u 3 AU U s , find (approximately) the direction of the 

angular momentum right after the sideways kick takes place. 

(b) Use eqs. ( 8 . 88 ) to then show that the CM travels (approximately) in 
a circle around L. And show that this “circular” motion is just what 
you would expect from the reasoning in Section 8.6.2 (in particular, eq. 
(8.53)), concerning the free top. 

Additional problems 

26. Rolling straight? ** 

In some situations, for example the rolling-coin setup in Problem 22, the ve¬ 
locity of the CM of a rolling object changes direction as time goes by. Consider 
now a uniform sphere that rolls on the ground without slipping. Is it possible 
for the velocity of its CM to change direction? Justify your answer rigorously. 

27. Ball on paper *** 

A ball rolls without slipping on a table. It rolls onto a piece of paper. You 
slide the paper around in an arbitrary (horizontal) manner. (It’s fine if there 
are abrupt, jerky motions, so that the ball slips with respect to the paper.) 
After you allow the ball to come off the paper, it will eventually resume rolling 
without slipping on the table. Show that the final velocity equals the initial 
velocity. 

28. Ball on turntable **** 

A ball with uniform mass density rolls without slipping on a turntable. Show 
that the ball moves in a circle (as viewed from the inertial lab frame), with a 
frequency equal to 2/7 times the frequency of the turntable. 



moves farther 
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8.10 Solutions 

1. Fixed points on a sphere 

First solution: For the purposes of Theorem 8.1, we only need to show that two 
points remain fixed for an infinitesimal transformation. But since it’s possible to 
prove this result for a general transformation, we’ll consider the general case here. 
Consider the point A that ends up farthest away from where it started. (If there is 
more than one such point, pick any one of them.) Label the ending point B. Draw the 
great circle, Cab , through A and B. Draw the great circle, Ca, that is perpendicular 
to Cab at A; and draw the great circle, Cb, that is perpendicular to Cab at B. 

We claim that the transformation must take Ca to Cb- This is true for the following 
reason. The image of Ca is certainly a great circle through B, and this great circle 
must be perpendicular to Cab, because otherwise there would exist another point 
that ended up farther away from its starting point than A did (see Fig. 8.49). Since 
there is only one great circle through B that is perpendicular to Cab, the image of 
Ca must in fact be Cb- 

Now consider the two points, Pi and P 2 , where Ca and Cb intersect. (Any two great 
circles must intersect.) Let’s look at Pi. The distances Pi A and P\ B are equal. 
Therefore, the point Pi is not moved by the transformation. This is true because Pi 
ends up on Cb (because Cb is the image of Ca, which is where Pi started), and if it 
ends up at a point other than Pi, then its final distance from B would be different 
from its initial distance from A. This would contradict the fact that distances are 
preserved on a rigid sphere. Likewise for Pj. 

Note that for a non-infinitesimal transformation, every point on the sphere may move 
at some time during the transformation. What we just showed is that two of the 
points end up back where they started. 

Second solution: In the spirit of the above solution, we can give simpler solution, 
but which is valid only in the case of infinitesimal transformations. 

Pick any point, A, that moves during the transformation. Draw the great circle that 
passes through A and is perpendicular to the direction of A’s motion. (Note that this 
direction is well-defined, because we are considering an infinitesimal transformation.) 
All points on this great circle must move (if they move at all) perpendicularly to the 
great circle, because otherwise their distances to A would change. But they cannot all 
move in the same direction, because then the center of the sphere would move (but it 
is assumed to remain fixed). Therefore, at least one point on the great circle moves in 
the direction opposite to the direction in which A moves. Therefore (by continuity), 
some point (and hence also its diametrically opposite point) on the great circle must 
remain fixed. 

2. Many different u5’s 

We want to find all the vectors, u>, such that u ; x ax = vy. Since u> is orthogonal to 
this cross product, u> must lie in the x-z plane. We claim that if w makes an angle 
6 with the a;-axis, and has magnitude v/(a sin0 ), then it will satisfy u x ax = vy. 
Indeed, 

u> x ax = |a>||ax| sin0y = vy. (8.96) 

Alternatively, note that 00 may be written as 

“=^ <c<s( ’' 0 - si,,e) =(^' 0 ' D' 


( 8 . 97 ) 
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and only the ^-component is relevant in the cross product with ax. 

It makes sense that the magnitude of o* is v/(a sin 9), because the particle is traveling 
in a circle of radius a sin# around o*, at speed v. 

A few possible o*’s are drawn in Fig. 8.50. Technically, it is possible to have 7r < 
9 < 2n, but then the v/(asm9) coefficient in eq. (8.97) is negative, so o* really points 
upward in the x-z plane, (o* must point upward if the particle’s velocity is to be in 
the positive y-direction.) Note that ui z is independent of 9, so all the possible o*’s 
look like those in Fig. 8.51. 

For 9 = 7t/2, we have o* = v/a, which makes sense. If 9 is very small, then a* is very 
large. This makes sense, because the particle is traveling around in a very small circle 
at the given speed v. 


Remark: The point of this problem is that the particle may actually be in the process of 
having its position vector trace out a cone around one of many possible axes (or perhaps 
it may be undergoing some other complicated motion). If we are handed only the given 
initial information on position and velocity, then it is impossible to determine which of these 
motions is happening. And it is likewise impossible to uniquely determine a*. (This is true 
for a collection of points that lie on at most one line through the origin. If the points, along 
with the origin, span more than a 1-D line, then a* is in fact uniquely determined.) Jft 

3. Rolling cone 

At the risk of overdoing it, we’ll present three solutions. The second and third solu¬ 
tions are the type that tend to make your head hurt, so you may want to reread them 
after studying the discussion on the angular velocity vector in Section 8.7.2. 


First solution: Without doing any calculations, we know that o* points along the 
line of contact of the cone with the table, because these are the points on the cone 
that are instantaneously at rest. And we know that as time goes by, o* rotates around 
in the horizontal plane with angular speed v/(hcosa), because P travels at speed v 
in a circle of radius h cos a around the 2 -axis. 

The magnitude of o* can be found as follows. At a given instant, P may be considered 
to be rotating in a circle of radius d = hsina around o*. (see Fig. 8.52). Since P 
moves with speed v, the angular speed of this rotation is v/d. Therefore, 


(8.98) 


Second solution: We can use Theorem 8.3 with the following frames. 8 % is fixed 
in the cone; S 3 is the lab frame; and S 2 is the frame that (instantaneously) rotates 
around the tilted 0 * 2,3 axis shown in Fig. 8.53, at the speed such that the axis of the 
cone remains fixed in it. (The tip of 0 * 2,3 will trace out a circle as it precesses around 
the z-axis, so after the cone moves a little, we will need to use a new S 2 frame. But 
at any moment, S 2 instantaneously rotates around the axis perpendicular to the axis 
of the cone.) In the language of Theorem 8.3, 0 * 1,2 and 0 * 2,3 point in the directions 
shown. We must find their magnitudes and then add the vectors to find the angular 
velocity of Si with respect to S 3 . 

First, we have 

Ksl = (8-99) 

because point P moves (instantaneously) with speed v in a circle of radius h around 
0 * 2 , 3 . 



Figure 8.50 


Figure 8.51 



Figure 8.53 
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®2,3 = Vlh 



Figure 8.54 



▼ ®2,3 

Figure 8.55 



Figure 8.56 


We now claim that 

K 2 | = (8.100) 

where r = h tan a is the radius of the base of the cone. This is true because someone 
fixed in S 2 will see the endpoint of the radius (the one drawn) moving “backwards” 
at speed v, because it is stationary with respect to the table. Hence, the cone must 
be spinning with frequency v/r in S-2- 

The addition of Wi,2 and 0*2,3 is shown in Fig. 8.54. The result has magnitude 
v/(hsm.a), and it points horizontally (because |o*2,3|/|o*i, 2 | = tana). 


Third solution: We can use Theorem 8.3 with the following frames. Si is fixed 
in the cone; and S 3 is the lab frame (as in the second solution). But now let S 2 be 
the frame that rotates around the (negative) z- axis, at the speed such that the axis 
of the cone remains fixed in it. (Note that we can keep using this same S 2 frame as 
time goes by, unlike the S 2 frame in the second solution.) 0*1,2 and 0*2,3 point in the 
directions shown in Fig. 8.55. As above, we must find their magnitudes and then 
add the vectors to find the angular velocity of S± with respect to S3. 


1 ^ 2,3 | = 


( 8 . 101 ) 


because point P moves with speed v in a circle of radius hcosa around 0*2,3 


It’s a little tricker to find |o*i,21- Consider the circle of contact points on the table 
where the base of the cone touches it. This circle has a radius h/ cos a. From the 
point of view of someone spinning around with S 2 , the table rotates backwards with 
frequency |o* 2 ,3 1 = v/(hcosa), so this person sees the circle of contact points move 
with speed v/ cos 2 a around the vertical. Since there is no slipping, the contact point 
on the cone must also move with this speed around the axis of the cone (which is 
fixed in S 2 ). And since the radius of the base is r, this means that the cone rotates 
with angular speed v/(rcos 2 a) with respect to Si. Therefore, 


|wi, 2 | 


V 

r cos 2 a 


h sin- 


icosa 


( 8 . 102 ) 


The addition of u*i >2 and 0*2,3 is shown in Fig. 8.56. The result has magnitude 
v/(hsma), and it points horizontally (because |u* 2> 3|/|u* 1)2 | = sin a). 

4. Parallel-axis theorem 

Consider one of the diagonal entries in I, say In = f(y 2 + z 2 ). In terms of the new 
variables, this equals 

I11 = J ((Y + y') 2 T t/Z 4- z') 2 ) 

= j(Y 2 + Z 2 ) + j(y ' 2 + z' 2 ) 

= M(Y 2 + Z 2 )+ J (y' 2 + z' 2 ), (8.103) 

where we have used the fact that the cross terms vanish because, for example, fYy' = 
Y f y' = 0, by definition of the CM. 

Similarly, consider an off-diagonal entry in I, say I12 = — / xy. We have 


--/<* 


+ x')(Y + y') 
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- -J XY ~J xV 

= -M(XY) - j x'y', (8.104) 

where the cross terms have likewise vanished. We therefore see that all of the terms 
in I take the form of those in eq. (8.17), as desired. 

5. Existence of principal axes for a pancake 

For a pancake object, the inertia tensor I takes the form in eq. (8.8), with z = 0. 
Therefore, if we can find a set of axes for which f xy = 0, then I will be diagonal, and 
we will have found our principal axes. We can prove, using a continuity argument, 
that such a set of axes exists. 

Pick a set of axes, and write down the integral f xy = I 0 . If J 0 = 0, then we are done. 
If Io ^ 0, then rotate these axes by an angle tt/2, so that the new x is the old y, and 
the new y is the old —x (see Fig. 8.57). Write down the new integral / xy = 4/ 2 . 
Since the new and old coordinates are related by x ne w = Void and y new = —*01(15 we 
have 4/2 = — Io- Therefore, since / xy switched sign during the rotation of the axes, 
there must exist some intermediate angle for which the integral f xy is zero. 

6. Symmetries and principal axes for a pancake 

First Solution: In view of the form of the inertia tensor given in eq. (8.8), we want 
to show that if a pancake object has a symmetry under a rotation through 9 ^ n, 
then f xy = 0 for any set of axes (through the origin). 

Take an arbitrary set of axes and rotate them through an angle 9 ^ n. The new 
coordinates are x' = (x cos 9 + y sin 9) and y' = (— x sin 9 + y cos 9 ), so the new matrix 
entries, in terms of the old ones, are 

4 = J x n = I xx cos 2 9 + 2 I xy sin 9 cos 9 + I yy sin 2 9, 

lyy = j v' 2 = Lx sin 2 9 — 2 I xy sin 9 cos 9 + I yy cos 2 9, 

l' xy = J x'y' = —I xx sm9cos9 + I X y(cos 2 9 — sm 2 9) + I yy sm9cos9.(8.105) 

If the object looks exactly like it did before the rotation, then I' xx = I xx , I' yy = I yy , 
and I xy = I xy . The first two of these are actually equivalent statements, so we’ll just 
use the first and third. Using cos 2 9 — 1 = — sin 2 9, these give 



Figure 8.57 


0 = -I xx sin 2 9 + 2I xy sm9 cos 9 + I yy sin 2 9, 

0 = — I xx sin 0 cos 9 — 2I xy sin 2 9 + I yy sin 0 cos 9. (8.106) 

Multiplying the first of these by cos 9 and the second by sin 9, and subtracting, gives 
2I xy sin# = 0. (8.107) 

Under the assumption 9 ^ n (and 9 ^ 0, of course), we must therefore have I xy = 0. 
Our initial axes were arbitrary; hence, any set of axes (through the origin) in the 
plane is a set of principal axes. 

Remark: If you don’t trust this result, then you may want to show explicitly that the 
moments around two orthogonal axes are equal for, say, an equilateral triangle centered at 
the origin (which implies, by Theorem 8.5, all axes in the plane are principal axes). & 
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Figure 8.58 



Second Solution: If an object is invariant under a rotation through an angle 0, 
then 6 must be of the form 0 = 2n/N, for some integer N (convince yourself of this). 24 
Consider a regular A-gon with “radius” R, with point-masses m located at the ver¬ 
tices. Any object that is invariant under a rotation through 6 = 2n/N can be con¬ 
sidered to be built out of regular point-mass A-gons of various sizes. Therefore, if we 
can show that any axis in the plane of a regular point-mass A-gon is a principal axis, 
then we’re done. We can do this as follows. 

In Fig. 8.58, let </> be the angle between the axis and the nearest mass to its right. 
Label the A masses clockwise from 0 to A — 1, starting with this one. Then the angle 
between the axis and mass k is <f> + 2Trk/N. And the distance from mass k to the axis 
is r-fc = |i?sin(0 + 2-jrk/N)\. 

The moment of inertia around the axis is ft = rnft . In view of Theorem 8.5, if 

we can show that ft = ft> for some <f> A ft (with <fi A ft+n), then we have shown that 
every axis is a principal axis. We will do this by demonstrating that ft, is independent 
of <j). We’ll use a nice math trick here, which involves writing a trig function as the 
real part of a complex exponential. We have 


mR 2 sin 2 U+ 

k =0 ' ' 

mR 2 A / 47rfc\\ 

fe=0 v 7 

Nm & - m ^ 2 Re (eW+^/ N ^ 


—Re (e 2i ^ (l + e 4ni / N + e & ™/ N + • • • + e 4 < 


NmR 2 mR 2 ^ ( 2ifli ( e iN * i/N - 1 \ 

( e ( e 4 *i/N _ 1 ) 


1 ) 7 ri/N^ 

(8.108) 


where we have summed the geometric series to obtain the last line. The numerator 
in the parentheses equals e 4m — 1=0. And if A ^ 2, the denominator is not zero. 
Therefore, if A ^ 2 (which is equivalent to the 6 A tt restriction), then 


h 


NmR 2 
2 


(8.109) 


which is independent of ft Hence, the moments around all axes in the plane are equal, 
so every axis in the plane is a principal axis, by Theorem 8.5 


Remarks: Given that the moments around all the axes in the plane are equal, they must 
be equal to NmR 2 /2, because the perpendicular-axis theorem says that they all must be 
one-half of the moment around the axis perpendicular to the plane (which is NmR 2 ). Jl> 

7. Rotating square 

Label two of the masses A and B, as shown in Fig. 8.59. Let l a be the distance along 

24 If N is divisible by 4, then a quick application of Theorem 8.5 shows that any axis in the plane 
is a principal axis. But if N isn’t divisible by 4, this isn’t so obvious. 


Figure 8.59 
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the axis from the CM to M’s string, and let r a be the length of M’s string. Likewise 
for B. The force, Fa, in M’s string must account for the centripetal acceleration of 
M. Hence, Fa = mrAW 2 . The torque around the CM due to Fa is therefore 

ta = mrAlAU 2 - ( 8 . 110 ) 

Likewise, the torque around the CM due to B’s string is tb = mr/M/jCc' 2 , in the 
opposite direction. 

But the two shaded triangles in Fig. 8.59 are congruent (they have the same hy¬ 
potenuse and the same angle 9). Therefore, I a = tb and Ib = r a ■ Hence, ta = tb, 
and the torques cancel. The torques from the other two masses likewise cancel. (Note 
that a uniform square is made up of many sets of these squares of point masses, so 
we’ve also shown that no torque is needed for a uniform square.) 

Remark: For a general IV-gon of point masses, Problem 6 shows that any axis in the 
plane is a principal axis. We should be able to use the above torque argument to prove 
this. This can be done as follows. (It’s time for a nice math trick, involving the imaginary 
part of a complex exponential.) Using eq. (8.110), we see that the torque from mass 
M in Fig. 8.60 is ta = muj 2 R 2 sin 0 cos 0. Likewise, the torque from mass B is tb = 
mu! 2 R 2 sin(0+27r/N) cos(0+27r/lV), and so on. The total torque around the CM is therefore 



= 0 , ( 8 . 111 ) 

provided that IV/ 2 (but it’s hard to have a 2-gon, anyway). This was essentially another 
proof of Problem 6. To prove that the torque was zero (which is one of the definitions of 
a principal axis), we showed here that = 0. In terms of the chosen axes, this is 

equivalent to showing that ^xy = 0, that is, showing that the off-diagonal terms in the 
inertia tensor vanish (which is simply another definition of the principal axes). & 

8. A nice cylinder 

Three axes that are certainly principal axes are the symmetry axis and two orthogonal 
diameters. The moments around the latter two are equal (call them I). Therefore, 
by Theorem 8.5, if the moment around the symmetry axis also equals /, then every 
axis is a principal axis. 

Let the mass of the cylinder be M. Let its radius be R and its height be h. Then the 
moment around the symmetry axis is MB?/ 2. 

Let D be a diameter through the CM. The moment around D can be calculated as 
follows. Slice the cylinder into horizontal disks of thickness dy. Let p be the mass 
per unit height (so p = M/h). The mass of each disk is then p dy, so the moment 
around a diameter through the disk is (pdy)R 2 / 4. Therefore, by the parallel-axis 



Figure 8.60 
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Figure 8.62 


theorem, the moment of a disk at height y (where —h /2 < y < /i/2) around D is 
(pdy)R 2 /4 + ( pdy)y 2 . Hence, the moment of the entire cylinder around D is 


I= f h/ 2 (pj?_ 

J-h/2 V 4 


pR 2 h ph 3 _ MR 2 


PV dy= —— + — 


We want this to equal MR?/ 2. Therefore, 

h = V3R. 


( 8 . 112 ) 


(8.113) 


You can show that if the origin was instead taken to be the center of one of the circular 
faces, then the answer would be h = V3R/2. 

9. Rotating rectangle 

If the force is out of the page at the upper left corner and into the page at the 
lower right corner, then the torque r = r x F points upward to the right, as shown 
in Fig. 8.61, with r oc ( b,a ). The angular momentum equals / r dt. Therefore, 
immediately after the strike, L is proportional to (b,a). 

The principal moments are I x = mb 2 /12 and I y = ma 2 / 12. The angular momentum 
may be written as L = ( I x u > x , I y u y ). Therefore, since we know L oc (6, a), we have 

oc (1 Pj oc (A, oc M), (8.114) 

which is the direction of the other diagonal. This answer checks in the special case 
a = b, and also in the limit where either a or b goes to zero. 

10. Rotating stick 

The angular momentum around the CM may be found as follows. Break u> up into 
its components along the principal axes of the stick (which are parallel and perpen¬ 
dicular to the stick). The moment of inertia around the stick is zero. Therefore, to 
compute L, we need to know only the component of u perpendicular to the stick. 
This component is cu sin 6 , and the associated moment of inertia is ml 2 / 12. Hence, 
the angular momentum at any time has magnitude 

L = j^mi 2 uj sin#, (8.115) 

and it points as shown in Fig. 8.62. The tip of the vector L traces out a circle 
in a horizontal plane, with frequency u>. The radius of this circle is the horizontal 
component of L, which is L± = L cos 0. The rate of change of L therefore has 
magnitude 

= u>L± = ujLcosO = tu cos0, (8.116) 

and it is directed into the page at the instant shown. 

Let the tension in the strings be T. Then the torque due to the strings is r = 
2T(I/2)cos6, directed into the page at the instant shown. Therefore, r = dL/dt 
gives 

T£cos0 = u^mt 2 u sin e'jcose, (8.117) 

and so 

T = ^-m£u 2 sin6. ( 8 . 118 ) 


dL 

dt 
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Remarks: For 9 —> 0, this goes to zero, which makes sense. For 9 —> n/2, it goes to the 
finite value miJ 1 /12, which isn’t entirely obvious. 

Note that if we instead had a massless stick with equal masses of m/2 on the ends (so that 
the relevant moment of inertia is now ml 2 / 4), then our answer would be T = mtui 2 sin 0/4. 
This makes sense if we write it as T n=s (m/2)(£sin0/2)o; 2 , because each tension is simply 
responsible for keeping a mass of m/2 moving in a circle of radius {i/2) sin 9 at frequency u>. 

* 

11. Another rotating stick 

As in Problem 10, the angular momentum around the CM may be found by breaking 
u> up into its components along the principal axes of the stick (which are parallel and 
perpendicular to the stick). The moment of inertia around the stick is zero. Therefore, 
to compute L, we need to know only the component of u) perpendicular to the stick. 
This component is u>sin0, and the associated moment of inertia is mi 2 / 12. Hence, 
the angular momentum at any time has magnitude 

L = ^-m£ 2 ujsm6, (8.119) 


and it points as shown in Fig. 8.63. The change in L comes from the horizontal 
component. This has length LcosO and travels in a circle at frequency w. Hence, 
\dL/dt\ = u)L cos 9, and it is directed into the page at the instant shown. 

The torque around the CM has magnitude mg{I/ 2) sin 9, and it points into the page 
at the instant shown. (This torque arises from the vertical force from the rail. There 
is no horizontal force from the rail, because the CM does not move.) Therefore, 
t = dL/dt gives 

mqismO (ml 2 u>sm6\ 

^ — w ( — ) cosff, (8.120) 


and so 



( 8 . 121 ) 


| mg 



Figure 8.63 


Remarks: For 9 — * 7t/2, this goes to infinity, which makes sense. For 9 — * 0, it goes to the 
constant \J&g/i, which isn’t so obvious. 

The motion in this problem is not possible if the bottom end of the stick, instead of the 
top end, slides along a rail. The magnitudes of all quantities are the same as in the original 
problem, but the direction of the torque (as you can check) is in the wrong direction. 

If we instead had a massless stick with equal masses of m/2 on the ends (so that the relevant 
moment of inertia is now mi 2 / 4), then our answer would be u> = y/2g/{icos9). This is 
simply the oj = \Jg/[{t/2 ) cos 9] answer for a point-mass spherical pendulum of length 1/2 
(see Problem 12), because the middle of the stick is motionless. 

Note that the original massive stick cannot be treated like two sticks of length t/2. That is, 
the answer in eq. (8.121) is not obtained by using i/2 for the length in eq. (8.35). This is 
because there are internal forces in the stick that provide torques; if a free pivot were placed 
at the CM of the stick in this problem, the stick would not remain straight. X 

12. Spherical pendulum 

(a) The forces on the mass are gravity and the tension from the rod (see Fig. 8.64). 
Since there is no vertical acceleration, we have T cos 6 = mg. The unbalanced 



-- J mg 

Figure 8.64 







-^ J mg 

Figure 8.65 

w/Q 2 sin0 | m £ 



Figure 8.66 
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horizontal force from the tension is therefore T sin 9 = mg tan 9. This force 
accounts for the centripetal acceleration, m(£sm9)U 2 . Hence, 


Q = 


I 9 
V £ COS 9 


( 8 . 122 ) 


Remark: For 9 « 0, this is the same as the \fgji frequency for a simple pendulum. 
For 9 « 7r/2, it goes to infinity, which makes sense. Note that 9 must be less than 7r/2 
for circular motion to be possible. (This restriction does not hold for a gyroscope with 
extended mass.) Jit 

(b) The only force that applies a torque relative to the pivot is the gravitational 
force. The torque is r = rngt sin 0, directed into the page (see Fig. 8.65). 

At this instant in time, the mass has a speed (£sin9)Q, directed into the page. 
Therefore, L^rxp has magnitude rnUll sin 0, and is directed upward to the 
right, as shown. 

The tip of L traces out a circle of radius Lcos9, at frequency 0. Therefore, 
dL/dt has magnitude IlL cos 0, and is directed into the page. 

Hence, r = dL/dt gives rngl sin 0 = sind) cos 6 . This yields eq. (8.122). 

(c) The only force that applies a torque relative to the mass is that from the pivot. 
There are two components to this force (see Fig. 8.66). 

There is the vertical piece, which is mg. Relative to the mass, this provides a 
torque of mg(£ sin 9), which is directed into the page. 

There is also the horizontal piece, which accounts for the centripetal acceleration 
of the mass. This equals m(£sm9)U 2 . Relative to the mass, this provides a 
torque of m£ 0. 2 sin 9 (£ cos 0 ), which is directed out of the page. 

Relative to the mass, there is no angular momentum. Therefore, dL/dt = 0. 
Hence, there must be no torque; so the above two torques cancel. This implies 
that mg(£ sin#) = m£Q 2 sm9(£cos0), which yields eq. (8.122). 

Remark: In problems that are more complicated than this one, it is often easier to 
work with a fixed pivot as the origin (if there is one) instead of the CM, because then 
you don’t have to worry about messy pivot forces contributing to the torque. Jit 

13. Rolling in a cone 

(a) The forces on the particle are gravity (mg) and the normal force ( N ) from the 
cone. Since there is no net force in the vertical direction, we have 

N sm.9 = mg. (8.123) 

The inward horizontal force is therefore N cos 9 = mg/ tan 9. This force accounts 
for the centripetal acceleration of the particle moving in a circle of radius h tan 9. 
Hence, mg/ tan 9 = m(/i tan #)fl 2 , and so 



(b) The forces on the ring are gravity (mg), the normal force (N) from the cone, 
and a friction force (F) pointing up along the cone. Since there is no net force 
in the vertical direction, we have 


N sin 9 + F cos 0 = mg. 


(8.125) 
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The fact that the inward horizontal force accounts for the centripetal accelera¬ 
tion yields 

NcosO — Fsind = m(htanO)fl 2 . (8.126) 

The previous two equations may be solved to yield F. The result is 

F = mg cos 9 — mfl 2 (h tan 6 ) sin 9. (8.127) 

The torque on the ring (relative to the CM) is due solely to this F (because 
gravity provides no torque, and N points though the center of the ring, by the 
second assumption in the problem). Therefore, r = rF equals 

r = r(mgcos 0 — mfl 2 (htan 0 ) sin#). (8.128) 


and t points horizontally. 

We must now find dL/dt. Since are assuming r <C /i tan 0, the frequency of the 
spinning of the ring (call it w) is much greater than the frequency of precession, 
fi. We will therefore neglect the latter in finding L. In this approximation, L is 
simply mr 2 u>, and L points upward (or downward, depending on the direction of 
the precession) along the cone. The horizontal component of L has magnitude 
L i = L sin 0, and it traces out a circle at frequency fi. Therefore, dL/dt has 
magnitude 

= UL± = ULsm9 = Cl(mr 2 uj)smd, (8.129) 

and it points horizontally, in the same direction as r. 

The non-slipping condition is ru) = (ft tan 0)0, 25 which gives u) = (htaxiO)Q/r. 
Using this in eq. (8.129) yields 

= Cl 2 mrh tan 9 sin 6 , (8.130) 

Equating this |dL/dt| with the torque in eq. (8.128) gives 


dL 

dt 


dL 

dt 


(8.131) 


This frequency is l/\/2 times the frequency found in part (a). 

Remark: If you consider an object with moment of inertia rjmr 2 (our ring has t) = 1), 
then you can show by the above reasoning that the “2” in eq. (8.131) is simply replaced 
by (1 + n). * 

14. Tennis racket theorem 

Presumably the experiment worked out as it was supposed to, without too much harm 
to the book. Now let’s demonstrate the result mathematically. 


Rotation around xi: If the racket is rotated (nearly) around the xi axis, then the 
initial o>2 and 1U3 are much smaller than uq. To emphasize this, relabel uq —> €2 and 

25 This is technically not quite correct, for the same reason that the earth spins around 366 times 
instead of 365 times in a year. But it’s valid enough in the limit of small r. 
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u) :i £ 3 . Then eqs. (8.43) become (with the torque equal to zero, because only 
gravity acts on the racket) 


0 = 

Cj\ - Ae 2 £3, 


0 = 

£2 + Buj i£ 3 , 


0 = 

£3 — Cu>iC 2 , 

(8.132) 

where we have defined (for convenience 

) 


A^ 1 ^, B = 

. h ~ h ^ c _! i~h 

(8.133) 


Note that A, B, and C are all positive (this fact will be very important). 

Our goal here is to show that if the e’s start out small, then they remain small. 
Assuming that they are small (which is true initially), the first equation says that 
chi ~ 0 (to first order in the e’s). Therefore, we may assume that u>\ is essentially 
constant (when the e’s are small). Taking the derivative of the second equation then 
gives 0 = €2 + BujNs- Plugging the value of £3 from the third equation into this yields 

e 2 = ~(BCul)e 2 . (8.134) 

Because of the negative coefficient on the right-hand side, this equation describes 
simple harmonic motion. Therefore, £2 oscillates sinusoidally around zero. Hence, if 
it starts small, it remains small. By the same reasoning, £3 remains small. 

We therefore see that w « (oq,0,0) at all times, which implies that L « {Lui -\, 0,0) 
at all times. That is, L always points (nearly) along the xq direction (which is fixed 
in the racket frame). But the direction of L is fixed in the lab frame (because there 
is no torque). Therefore, the direction of xq must also be (nearly) fixed in the lab 
frame. In other words, the racket doesn’t wobble. 

Rotation around xq: The calculation goes through exactly as above, except with “1” 
and “3” interchanged. We find that if ei and £2 start small, they remain small. And 
u> « (0,0,^) at all times. 

Rotation around x 2 : If the racket is rotated (nearly) around the x 2 axis, then the ini¬ 
tial ui-i and U 3 are much smaller than u> 2 - As above, let’s emphasize this by relabeling 
u)-i —* £1 and 0 J 3 —> £3. Then as above, eqs. (8.43) become 

0 = £1 — Aai2^3i 

0 = u >2 + Be i£ 3 , 

0 = £3-Cw 2 ei, (8.135) 

Our goal here is to show that if the e’s start out small, then they do not remain 
small. Assuming that they are small (which is true initially), the second equation 
says that cl>2 ~ 0 (to first order in the e’s). So we may assume that is essentially 
constant (when the e’s are small). Taking the derivative of the first equation then 
gives 0 = e'i — Auj 263 . Plugging the value of £3 from the third equation into this yields 

e*i = (ACu)l)e v (8.136) 

Because of the positive coefficient on the right-hand side, this equation describes 

an exponentially growing motion, instead of an oscillatory one. Therefore, £1 grows 
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quickly from its initial small value. Hence, even if it starts small, it becomes large. 
By the same reasoning, e 3 becomes large. (Of course, once the e’s become large, then 
our assumption of W2 ~ 0 isn’t valid anymore. But once the e’s become large, we’ve 
shown what we wanted to.) 

We see that w does not remain (nearly) equal to (0, W2.0) at all times, which implies 
that L does not remain (nearly) equal to (0,/2n>2,0) at all times. That is, L does not 
always point (nearly) along the X2 direction (which is fixed in the racket frame). But 
the direction of L is fixed in the lab frame (because there is no torque). Therefore, 
the direction of x2 must change in the lab frame. In other words, the racket wobbles. 
15. Free-top angles 

In terms of the principal axes, x-| ,x 2 ,x 3 , we have 


= (uqxi + W2X2) + UI3X3, and 
L = /(uqxi + W2X2) + 73^3X3. (8.137) 


Let (uqxi + oj 2 x 2 ) = be the component of lo orthogonal to w 3 . Then, by 

definition, we have 


Therefore, 


tan/3 = 


W_L 

U 3 


and 


tana 


Ilo _l 
I3OJ3 


tana _ I 
tan /3 I 3 


(8.138) 

(8.139) 


If I > I 3 , then a > (3, and we have the situation shown in Fig. 8.67. A top with this 
property is called a “prolate top”. An example is a football or a pencil. 

If I < I 3 , then a < /3, and we have the situation shown in Fig. 8.68. A top with this 
property is called an “oblate top”. An example is a coin or a Frisbee. 


16. Gyroscope 


(a) In order for there to exist real solutions for ft in eq. (8.79), the discriminant must 
be non-negative. If 9 > n/2, then cos 9 < 0, so the discriminant is automatically 
positive, and any value of u > 3 is allowed. But if 9 < n/2, then the lower limit on 


is 


u>3 > 


sJ^Mlgi cos 9 

h 


= W3. 


(8.140) 


Note that at this critical value, eq. (8.79) gives 


n. = n 


I3W3 
21 cos 9 



(8.141) 


(b) Since u >3 has units, “large 0 J 3 " is a meaningless description. What we really 
mean is that the fraction in the square root in eq. (8.79) is very small compared 
to 1. That is, e = (4M Igl cos 9) /(7|w|) <C 1. In this case, we may use \/l — c « 
1 - e/2 + • • • to write 


n ± 


I3U3 


m 1 


2MIg£cos9 


))■ 


' 21 cos 9 V V /fcuf 
Therefore, the two solutions for ft are (to leading order in W3) 


(8.142) 




I cos 9 ’ 


and 


7 3 W3 


L 



Figure 8.67 


L 



Figure 8.68 


(8.143) 
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These are known as the “fast” and “slow” frequencies of precession, respectively. 
12_ is the approximate answer we found in eq. (8.75), and it was obtained here 
under the assumption e<Cl, which is equivalent to 


u>3 ^ 


y/AMIgl cos 6 

h 


(that is, W3 


^3). 


(8.144) 


This, therefore, is the condition for the result in eq. (8.75) to be a good ap¬ 
proximation. Note that if I is of the same order as I 3 (so that they are both of 
the order Ml 2 ), and if cos 6 is of order 1, then this condition may be written as 
uj » \/g/l, which is the frequency of a pendulum of length £. 



Figure 8.69 



Figure 8.70 


Remarks: The 12+ solution is a fairly surprising result. Two strange features of 12+ 
are that it grows with +3, and that it is independent of g. To see what is going on 
with this precession, note that 12+ is the value of 12 that makes the L± in eq. (8.77) 
essentially equal to zero. So L points nearly along the vertical axis. The rate of change 
of L is the product of a very small radius (of the circle the tip traces out) and a very 
large 12 (if we’ve picked +3 to be large). The product of these equals the “medium 
sized” torque Mglsm.6. 

In the limit of large UJ3 , the fast precession should look basically like the motion of a 
free top (because L is essentially constant), discussed in Section 8.6.2. And indeed, 
12+ is independent of g. We’ll leave it to you to show that 12+ « L/I, which is the 
precession frequency of a free top (eq. (8.53)), as viewed from a fixed frame. 

We can plot the 12+ of eq. (8.79) as functions of W3. With the definitions of +3 and 
12 q in eqs. (8.140) and (8.141), we can rewrite eq. (8.79) as 



It is easier to work with dimensionless quantities, so let’s rewrite this as 

y± = x ± x 1 — 1, with y± = , x = ^. 

‘ 2 0 0J3 

A rough plot of y± vs. x is shown in Fig. 8.69. & 


(8.145) 


(8.146) 


17. Many gyroscopes 

The system is made up of N rigid bodies, each consisting of a plate and the massless 
stick glued to it on its left (see Fig. 8.70). Label these sub-systems as Si, with Si 
being the one closest to the pole. 

Let each plate have mass m and moment of inertia 7, and let each stick have length 
l. Let the angular speeds be The relevant angular momentum of S) is then 
Li = Iuii, and it points horizontally. 26 Let the desired precession frequency be 12. 
Then the magnitude of dLi/dt is 7+12 = (7+',)12, and this points perpendicularly to 

L+ 

Consider the torque t % on S t , around its CM. Let’s first look at S\. The pole provides 
an upward force of Nmg (this force is what keeps all the gyroscopes up), so it provides 
a torque of Nmgl around the CM of S\. The downward force from the stick to the 
right provides no torque around the CM (because it acts at the CM). Therefore, 
r 1 = dLi/dt gives Nmgl = (7wi)12, and so 


U)i 


Nmgl 

712 


(8.147) 


26 We are ignoring the angular momentum arising from the precession. This part of L points 
vertically (because the gyroscopes all point horizontally) and therefore does not change. Hence, it 
does not enter into t = dL/dt. 
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Now look at S- 2 - Si provides an upward force of (N — 1 ) rrig (this force is what keeps 
S 2 through 5 jv up), so it provides a torque of (N — 1 )rrigl around the CM of S- 2 - The 
downward force from the stick to the right provides no torque around the CM of .S'2. 
Therefore, T2 = dL 2 /dt gives (N — 1 )mgt = (Iu> 2 )f2, and so 


U>2 


(N — 1) rrigl 

m 


(8.148) 


Similar reasoning applies to the other Si, and we arrive at 
_ (N + 1 — i)mg£ 

= m ‘ 

The u>i are therefore in the ratio 

uq : W 2 • ■ ■ ■ • &N—1 ■ &N = N : (TV — 1) : • • • : 2 : 1. 


(8.149) 


(8.150) 


Note that we needed to apply r = dL/dt many times, using each CM as an origin. 
Using only the pivot point on the pole as the origin would have given only one piece 
of information, whereas we needed N pieces. 


Remarks: As a double-check, we can verify that these a/s make t = dL/dt true, where t 
and L are the total torque and angular momentum relative to the pivot on the pole. (Using 
the CM of the entire system as the origin would give the same equation.) The CM of the 
entire system is (N + l)£/2 from the wall, so the torque due to gravity is 




The total angular momentum is, using eq. (8.149), 


(8.151) 


= "If ( V -(- V - l) + (V-2)-2flj 

mgi N(N + 1) 

= n-2—• (8 ' 152) 

So indeed, r = LQ = |dL/dt|. 

We can also pose this problem for the setup where all the uj t are equal (call them oj), and 
the goal is the find the lengths of the sticks that will allow the desired motion. We can use 
the same reasoning as above, and eq. (8.149) takes the modified form 


(N + 1 — i)mgli 


where U is the length of the ith stick. Therefore, the L are in the ratio 


(8.153) 


ii: *2 :■■■■■ Cv , : Cv - \. : y ' , : - • : \ : 1. (8.154) 

Note that since the sum 1/n diverges, it is possible to make the setup extend arbitrarily 
far from the pole. 

Again, we can verify that these £’s make t = dL/dt true, where r and L are the total torque 
and angular momentum relative to the pivot on the pole. As an exercise, you can show that 
the CM happens to be a distance £n from the pole. So the torque due to gravity is, using 
eq. (8.153) to obtain Cv, 


t = Nmg(.N = Nmg{u>ILl/mg) = NIuiQ.. (8.155) 


The total angular momentum is simply L = NIu). So indeed, r = LLl = |<JL/dt|. A 
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18. Heavy top on slippery table 

In Section 8.7.3, we looked at r and L relative to the pivot point. Such quantities are 
of no use here, because we can’t use r = dL/dt relative to the pivot point (because 
it is accelerating). We will therefore look at r and L relative to the CM, which is 
always a legal origin around which we can apply t = dL/dt. 

The only force the floor applies is the normal force Mg. Therefore, the torque relative 
to the CM has the same magnitude, Mgl sin 9, and the same direction as in Section 
8.7.3. If we choose the CM as the origin of our coordinate system, then all the Euler 
angles are the same as before. The only change in the whole analysis is the change 
in the 1% = 1% = I moment of inertia. We are now measuring these moments with 
respect to the CM, instead of the pivot point. By the parallel axis theorem, they are 
now equal to 

I' = I — M£ 2 . (8.156) 

Therefore, changing I to I — Ml 2 is the only modification needed. 

19. Fixed highest point 

For the desired motion, the important thing to note is that every point in the top 
moves in a fixed circle around the z-axis. Therefore, w points vertically. Hence, if 12 
is the frequency of precession, we have w = 12z. 

(Another way to see that u> points vertically is to view things in the frame that rotates 
with angular velocity 12z. In this frame, the top has no motion whatsoever. It is not 
even spinning, because the point P is always the highest point. In the language of 
Fig. 8.27, we therefore have u' = 0. Hence, w = 12z + u/x 3 = 12z.) 

The principal moments are (with the pivot as the origin; see Fig. 8.71) 


h 


MR? 

ft ’ 


I = I x = I 2 = Mt + - 


(8.157) 


where we have used the parallel-axis theorem to obtain the latter. The components 
of u along the principal axes are w 3 = 12 cos 9, and u>2 = 12 sin 9. Therefore (keeping 
things in terms of the general moments, 7 3 and I), 


L = 7 3 12 cos 0x 3 + 712 sin 0x 2 . 


(8.158) 


The horizontal component of L is then L± = (7 3 12 cos 0) sin 6 — (712 sin 9) cos 9, and 
so dL/dt has magnitude 


I I 
I dt I 


L_ l12 = 12 2 i 




(8.159) 


and is directed into the page (or out of the page, if this quantity is negative). This 
must equal the torque, which has magnitude |r| = Mgl sin#, and is directed into the 
page. Therefore, 


12 = 


Mgl 

(. I 3 — 7) cos 9 ’ 


(8.160) 


We see that for a general symmetric top, such precessional motion (where the same 
“side” always points up) is possible only if 


h > 7 . 


( 8 . 161 ) 
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Note that this condition is independent of 0. For the problem at hand, i 3 and I are 
given in eq. (8.157), and we find 


n = \Z (fi 2 -4P)cosfl ’ < 8 ' 162 > 

and the necessary condition for such motion is R > 21. 

Remarks: 

(a) It is intuitively clear that $7 should become very large as 6 —> 7r/2, although it is by 
no means intuitively clear that such motion should exist at all for angles near 7r/2. 

(b) f2 approaches a non-zero constant as 0 —> 0, which isn’t entirely obvious. 

(c) If both R and i are scaled up by the same factor, then $7 decreases. (This also follows 
from dimensional analysis.) 

(d) The condition I 3 > I can be understood in the following way. If I 3 = I, then Locu, 
and so L points vertically along uj. If I 3 > 7, then L points somewhere to the right of 
the z-axis (at the instant shown in Fig. 8.71). This means that the tip of L is moving 
into the page, along with the top. This is what we need, because t points into the 
page. If, however, I 3 < I, then L points somewhere to the left of the z-axis, so dL/dt 
points out of the page, and hence cannot be equal to r. 4k 

20. Basketball on rim 

Consider the setup in the frame rotating with angular velocity Uz. In this frame, 
the center of the ball is at rest. Therefore, if the contact points are to form a great 
circle, the ball must be spinning around the (negative) x 3 axis shown in Fig. 8.72. 
Let the frequency of this spinning be u/ (in the language of Fig. 8.27). Then the 
nonslipping condition says that u>'r = UR, and so u' = UR/r. Therefore, the total 
angular velocity vector of the ball in the lab frame is 

u) = Uz — u/x 3 = Uz — (7?/r)f7x 3 . (8.163) 



Figure 8.72 


Let us choose the center of the ball as the origin around which r and L are calculated. 
Then every axis in the ball is a principal axis, with moment of inertia I = (2/3 )mr 2 . 
The angular momentum is therefore 

L = Iw = IUz - I(R/r)Ux 3 . (8.164) 

Only the x 3 piece has a horizontal component which will contribute to dL/dt. This 
component has length L± = I{R./r)ii sin 6 . Therefore, dL/dt has magnitude 

= UL ± = -U 2 mrRsin9, (8.165) 

| at | 3 


and points out of the page. 

The torque (relative to the center of the ball) comes from the force at the contact 
point. There are two components of this force. The vertical component is mg, and 
the horizontal component is m(R—rcos6)U 2 (pointing to the left), because the center 
of the ball moves in a circle of radius (R — r cos 9). We then find the torque to have 
magnitude 


|r| = mg(r cos 6) — m(R — r cos 6)U 2 (r sin 9), 


(8.166) 
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Figure 8.73 


with outward from the page taken to be positive. Using the previous two equations, 
t = dL/dt gives 


12 2 


geos 9 

sin#(|i? — rcos 6 ) 


(8.167) 


Remarks: 


(a) 12 —> oo as 9 —> 0, which makes sense. 

(b) Also, 12 —> oo when R = (3/5)rcos0. This case, however, is not physical, because we 
must have R> r cos 9 in order for the other side of the rim to be outside the basketball. 

(c) You can also work out the problem for the case where the contact points trace out a 
circle other than a great circle (say, one that makes an angle (3 with respect to the 
great circle). The expression for the torque in eq. (8.166) is unchanged, but the value 
of to' and the angle of the X3-axis both change. Eq. (8.165) is therefore modified. The 
resulting 12, however, is a bit messy. $ 

21. Rolling lollipop 


(a) We claim that u> points horizontally to the right (at the instant shown in 
Fig. 8.73), with magnitude (R/r)U. This can be seen in (at least) two ways. 
The first method is to note that we essentially have the same scenario as in 
the “Rolling cone” setup of Problem 3. The sphere’s contact point with the 
ground is at rest (the non-slipping condition), so to must pass through this 
point (horizontally). The center of the sphere moves with speed 12 R. But since 
the center may also be considered to be instantaneously moving with frequency 
to in a circle of radius r around the horizontal axis, we have tor = UR. Therefore, 

w = (fl/r)n. 

The second method is to write u> as to = —12z + u/x 3 (in the language of Fig. 
8.27), where to' is the frequency of the spinning as viewed by someone rotating 
around the (negative) z axis with frequency 12. The contact points form a circle 
of radius R on the ground. But they also form a circle of radius r cos 6 on the 
sphere (where 6 is the angle between the stick and the ground). The non-slipping 
condition then implies UR = to'(rcos9). Therefore, to’ = UR/(rcos9), and 

to = —12z + u/x 3 = —12z + () (cos#x + sin#z) = (R/r) 12x, (8.168) 

\ r cos 9 J 

where we have used tan# = r/R. 

(b) Choose the pivot as the origin. The principal axes are then x 3 along the stick, 
along with any two directions orthogonal to the stick. Choose x 2 to be in the 
plane of the paper. Then the components of to along the principal axes are 

U 3 = (R/r)U cos 9, and to 2 =—(R/r)Usm9. (8.169) 

The principal moments are 

I 3 = (2/5)mr 2 , and I 2 = (2/5)mr 2 + m(r 2 + R 2 ), (8.170) 

where we have used the parallel-axis theorem. The angular momentum is L = 
/ 3 a; 3 x 3 + /2CJ2X2, so its horizontal component has length L± — / 3 u; 3 cos# — 
I 2 to 2 sin 9. Therefore, the magnitude of dL/dt is 
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12L L 

fi(/3W3 cos 8 — I 2 U 2 sin 8 ) 

12 ^(g TOr2 ) (—12 cos cos8 

-(Jmr 2 + m(r 2 + .R 2 ))(- ^12 sin#) sin 6^ 

12 2 to— ^jjr 2 + ( r2 + -R 2 ) sin 2 

7 -mrRfl 2 , (8.171) 

where we have used sin 0 = r/s/r 2 + R 2 . The direction of c/L /dt is out of the 
page. 

Remark: There is actually a quicker way to calculate dL/dt. At a given instant, 
the sphere is rotating around the horizontal :r-axis with frequency ui = (R/r)Q. The 
moment of inertia around this axis is I x = (7/5 )mr 2 , from the parallel-axis theorem. 
Therefore, the horizontal component of L has magnitude 

L x = I x uj = 7 -mrRQ.. (8.172) 

Multiplying this by frequency (namely 12) at which L swings around the 2-axis gives 
the result for |dL/dt| in eq. (8.171). Note that there is also a vertical component of 
L relative to the pivot, 27 but this component doesn’t change, so it doesn’t come into 
dL/dt. 4 

The torque (relative to the pivot) is due to the gravitational force acting at 
the CM, along with the normal force, N, acting at the contact point. (Any 
horizontal friction at the contact point will yield zero torque relative to the 
pivot.) Therefore, r points out of the page with magnitude |r| = (N — mg)R. 
Equating this with the \dL/dt\ in eq. (8.171) gives 

N = mg + ^mrf2 2 . (8.173) 

This has the interesting property of being independent of R (and hence 8 ). 

Remark: The pivot must provide a downward force of N — mg = (7/5)mrf2 2 , to 
make the net vertical force on the lollipop equal to zero. This result is slightly larger 
than the mr O 2 result for the “sliding” situation in Exercise 6. 

The sum of the horizontal forces at the pivot and the contact point must equal the 
required centripetal force of mRQ. 2 . But it is impossible to say how this force is divided 
up, without being given more information. $ 

22. Rolling coin 

Choose the CM as the origin. The principal axes are then £2 and X3 (as shown in 
Fig. 8.74), along with xi pointing into the paper. Let 12 be the desired frequency. 

27 This vertical component equals —mR 2 12, which makes intuitive sense. It can be obtained via 
the inertia tensor relative to the pivot, or by calculating L y = I 3 UJ 3 sin# + I 2 U 2 cos 8 . 


dL 

dt 
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Look at things in the frame rotating around the z-axis with frequency II. In this 
frame, the CM remains fixed, and the coin rotates with frequency u>' (in the language 
of Fig. 8.27) around the (negative) X3-axis. The non-slipping condition says that 
ijj'r = UR, and so u/ = IlR/r. Therefore, the total angular velocity vector of the coin 
in the lab frame is 

u) = flz — a/x3 = flz — —flx 3 . (8.174) 

But z = sin 0x 2 + cos 0X3, so we may write w in terms of the principal axes as 

a> = flsin0x 2 — U — cos0^ X3. (8.175) 

The principal moments are 

I 3 = (1/2 )mr 2 , and I 2 = (1/4 )mr 2 . (8.176) 


The angular momentum is L = / 2 u> 2 x 2 + /3W3X3, so its horizontal component has 
length L± = J 2 in 2 cos 9 — I 3 U 3 sin 9, with leftward taken to be positive. Therefore, the 
magnitude of dL/dt is 

= UL _ L 

= fl (/ 2 w 2 cos 9 — I 3 U 3 sin 9) 

= Si((lmr 2 )( U sin o'j cos 9 — (^ rnr 2 ) (- m/ r — cos 6)^ sin^ 

= -mrU 2 sin 9(2R — r cos 9), (8.177) 

with a positive quantity corresponding to dL/dt pointing out of the page (at the 
instant shown). 

The torque (relative to the CM) comes from the force at the contact point. There 
are two components of this force. The vertical component is mg, and the horizontal 
component is m(R — r cos 9)Il 2 (pointing to the left), because the CM moves in a 
circle of radius (R — r cos 9) . We then find the torque to have magnitude 

|r| = mg(r cos 9) — m(R — rcos0)fl 2 (rsin0), (8.178) 


I dL I 

I dt I 


with outward from the page taken to be positive. Equating this |r| with the \dL/dt\ 
from eq. (8.177) gives 


U 2 


_ 9 _ 

§i?tan0 — |r sin0 


(8.179) 


The right-hand side must be positive if a solution for fl is to exist. Therefore, the 
condition for the desired motion to be possible is 


R> 


5 

6 r 


COS0. 


(8.180) 


Remarks: 

(a) For 0 —> rr/2, eq. (8.179) gives O —> 0, as it should. And for 0 —> 0, we obtain Q —> 00 , 
which also makes sense. 
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(b) Note that for (5/6)rcos0 < R < rcosO, the CM of the coin lies to the left of the 
center of the contact-point circle (at the instant shown). The centripetal force, m(R — 
rcos0)fl 2 , is therefore negative (which means that it is directed radially outward, 
to the right), but the motion is still possible. As R gets close to (5/6)rcos$, the 
frequency 12 goes to infinity, which means that the radially outward force also goes to 
infinity. The coefficient of friction between the coin and the ground must therefore be 
correspondingly large. 

(c) We may consider a more general coin, whose density depends on only the distance from 
the center, and which has I 3 = rymr 2 . (For example, a uniform coin has r) = 1/2, and 
a coin with all its mass on the edge has rj = 1.) By the perpendicular axis theorem, 
h = I 2 = (l/ 2 )rjmr 2 , and you can show that the above methods yield 

° = (1 + r,)R tan 9 - (l+??/2)r sintf ' (8 ' 181 ^ 

The condition for such motion to exist is then 

R> ('j^) rCOs6 - * ( 8 - 182 ) 


23. Wobbling coin 

(a) Look at the setup in the frame rotating with angular velocity flz. In this frame, 
the location of the contact point remains fixed, and the coin rotates with fre¬ 
quency lu' (in the language of Fig. 8.27) around the negative £3 axis. The 
radius of the circle of contact points on the table is R cos 9. Therefore, the non¬ 
slipping condition says that u>'R = Cl(R cos 0), and so u>' = ficos 9. Hence, the 
total angular velocity vector of the coin in the lab frame is 

u) = flz — a/x3 = fl(sin#X2 + cos 0x3) — (f2cos0)x3 = ft sin 0X2- (8.183) 

In retrospect, it makes sense that 00 must point in the X2 direction. Both the 
CM and the instantaneous contact point on the coin are at rest, so w must lie 
along the line containing these two points (that is, along the X2-axis). 

(b) Choose the CM as the origin. The principal moment around the X2-axis is 
I = mR 2 /4. The angular momentum is L = /0^2X2, so its horizontal component 
has length L± = L cos 9 = (Iuz) cos 9. Therefore, dL/dtt has magnitude 

= QL ± =n(^^j (Qsin0)cos0, (8.184) 

and it points out of the page. 

The torque (relative to the CM) is due to the normal force at the contact point 
(there is no sideways friction force at the contact point, because the CM is 
motionless), so it has magnitude 


dL 

dt 


|r| = mgRcos9, 


(8.185) 


and it also points out of the page. Using the previous two equations, t = dL/dt 
gives 


Q = 2, 


(8.186) 
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Remarks: 

i. Q —> oo as 9 — > 0. This is quite evident if you do the experiment; the contact 
point travels very quickly around the circle. 

ii. Q —> 2\fgjR as 9 —> tx/2. This isn’t so intuitive (to me, at least). In this case, L 
points nearly vertically, and it traces out a tiny cone, due to a tiny torque. 

iii. In this limit 9 — > 7t/2, O is also the frequency at which the plane of the coin spins 
around the vertical axis. If you spin a coin very fast about a vertical diameter, 
it will initially undergo a pure spinning motion with only one contact point. It 
will then gra duall y lose energy due to friction, until the spinning frequency slows 
down to 2iy/ g/R, at which time it will begin to wobble. (We’re assuming, of 
course, that the coin is very thin, so that it can’t balance on its edge.) In the case 
where the coin is a quarter (with R « .012m), this critical frequency of 2^/g/R 
turns out to be Q ss 57rad/s, which corresponds to about 9 Hertz. 

iv. The result in eq. (8.186) is a special case of the result in eq. (8.179) of Problem 22. 
The CM of the coin in Problem 22 will be motionless if R = r cos 0. Plugging this 
into eq. (8.179) gives fl 2 = 4g/(rsin9), which agrees with eq. (8.186), because r 
was the coin’s radius in Problem 22. £ 

(c) Consider one revolution of the point of contact around the z-axis. Since the 
radius of the circle on the table is R cos 9, the contact point moves a distance 
2ttR cos 6 around the coin during this time. Hence, the new point of contact on 
the coin is a distance 2-rrR — 2nR. cos 6 away from the original point of contact. 
The coin therefore appears to have rotated by a fraction (1 — cos 9) of a full turn 
during this time. The frequency with which you see it turn is therefore 


(1 - cos#)fl = 


2(1- cos 9) IJ 

\J sin 9 V R 


(8.187) 


Remarks: 

i. If 9 « 7t/2, then the frequency of Abe’s rotation is essentially equal to f2. This 
makes sense, because the top of Abe’s head will be, say, always near the top of 
the coin, and this point will trace out a small circle around the z-axis, with nearly 
the same frequency as the contact point. 

ii. As 9 —> 0, Abe appears to rotate with frequency 6'^ 2 \Jg/R (using sin 9 « 6 and 
cos# ~1- 0 2 /2). Therefore, although the contact point moves infinitely quickly 
in this limit, we nevertheless see Abe rotating infinitely slowly. 

iii. All o f the results for frequencies in this problem have to look like some multiple 
of g/R, by dimensional analysis. But whether the multiplication factor is zero, 
infinite, or something in between, is not at all obvious. 

iv. An incorrect answer for the frequency of Abe’s turning (when viewed from above) 
is that it equals the vertical component of w, which is u> z = u sin 9 = (fi sin 6 ) sin 9 = 
2(sin 9) 3 Py/g/R. This does not equal the result in eq. (8.187). (It agrees at 
9 = 7t/2, but is off by a factor of 2 for 9 —> 0.) This answer is incorrect because 
there is simply no reason why the vertical component of w should equal the fre¬ 
quency of revolution of, say, Abe’s nose, around the vertical axis. For example, at 
moments when w passes through the nose, then the nose isn’t moving at all, so it 
certainly cannot be described as moving around the vertical axis with frequency 

The result in eq. (8.187) is a sort of average measure of the frequency of rotation. 
Even though any given point on the coin is not undergoing uniform circular 
motion, your eye will see the whole coin (approximately) rotating uniformly. £ 
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24. Nutation cusps 

(a) Because both ^ and 9 are continuous functions of time, we must have (j> = 9 = 0 
at a kink. (Otherwise, either dO/dcj) = 9/(j> or dcp/dO = (f>/9 would be well-defined 
at the kink.) Let the kink occur at t = to- Then the second of eqs. (8.89) gives 
sin(o;„to) = 0. Therefore, cos(uj n to) — ±1, and the first of eqs. (8.89) gives 

AQ = tU s , (8.188) 


as we wanted to show. 


Remark: Note that if cos(w„fo) ; — 1, then AQ = — Q s , so eq. (8.88) says that the 
kink occurs at the smallest value of 6 , that is, at the highest point of the top’s motion. 
And if cos(u) n to) = —1, then AQ = Q s , so eq. (8.88) again says that the kink occurs 
at the highest point of the top’s motion, ft 


(b) 


To show that these kinks are cusps, we will show that the slope of the 9 vs. <j) 
plot is infinite on either side of the kink. That is, we will show that d6/d(f> = 
9/cj) = ±oo. For simplicity, we will look at the case where cos(uj n to) = 1 and 
AQ = —Q s . (The cos(u>„fo) = —1 case proceeds the same.) With AQ = —f l s , 
eqs. (8.89) give 


9 sin^osiniUnt 
1 — cos U) n t 


(8.189) 


Letting t = t 0 + e, we have (using cos(w n f 0 ) = 1 and sin(u;„t 0 ) = 0, and expand¬ 
ing to lowest order in e) 


9 sin 9§w n e 2 sin 9q 

<4e 2 /2 

For infinitesimal e, this switches from — oo to ±oo as e 

25. Nutation circles 


(8.190) 

s through zero. 


(a) Note that a change in angular speed of ACl around the fixed z-axis corresponds 
to a change in angular speed of sin d 0 AQ around the X2-axis. The kick therefore 
produces an angular momentum component (relative to the pivot) of I sin0 o A^ 
in the X2 direction. 

The original L pointed along the x 3 direction, with magnitude I3003. (These two 
statements hold to a good approximation if oj :i ±> fi g .) By definition, L made 
an angle 9q with the vertical z-axis. Therefore, from Fig. 8.75, the angle that 
the new L makes with the z-axis is (using AQ <c 0J3) 


I sin 9o AQ 
I30J3 


sinflpAQ 


(8.191) 


where we have used the definition of u) n from eq. (8.83). We see that the effect 
of the kick is to make L quickly change its 9 value. (It changes by only a 
small amount, because we are assuming u>„ ~ » AQ). The <p value doesn’t 

immediately change. 

(b) The torque (relative to the pivot) has magnitude Mg£ sin 9 and is directed hor¬ 
izontally. Because 9 doesn’t change appreciably, the magnitude of the torque is 
essentially constant, so L traces out a circle at a constant rate. This rate is sim¬ 
ply Q s . (None of the relevant quantities in r = dL/dt changed much from the 
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original circular-precession case, so the precession frequency remains basically 
the same.) Therefore, the new L has its (0, 6 ) coordinates given by 

m), 0(*))l = (n a t, e 0 - . (8.192) 

Looking at eqs. (8.88), we see that the coordinates of the CM relative to L are 

(<j>(t), 6 (t)) cm-l = ^ sinwjjf, sin0 o ^ cosw„f^ . (8.193) 

The sin 0 O factor in 6 (t) is exactly what is needed for the CM to trace out a 
circle around L, because a change in 0 corresponds to a CM spatial change of 
fA0sin0 o , whereas a change in 9 corresponds to a CM spatial change of (AO. 
This circular motion is exactly what we expect from the results in Section 8.6.2, 
for the following reason. For u) n very large and U s very small, L is essentially 
motionless, and the CM traces out a circle around it at frequency u> n . Since L is 
essentially constant, the top should therefore behave very much like a free top, 
as viewed from a fixed frame. 

Eq. (8.53) in Section 8.6.2 says that the frequency of the precession of £3 around 
L for a free top is L/I. But the frequency of the precession of £3 around L in 
the present problem is ui n , so this had better be equal to L/I. And indeed, L is 
essentially equal to I3U3, so L/I = I3U3/I = u> n . 

Therefore, for short enough time scales (short enough so that L doesn’t move 
much), a nutating top with UJ3 AQ Q s looks very much like a free top. 

Remark: We need the Af2 » fi, condition so that the nutation motion looks like 
circles. This can be seen by the following reasoning. The time for one period of the 
nutation motion is 2-K/uj n . From eq. (8.88), (f>(t) increases by A0 = 2ttQ s /u>„ in this 
time. And also from eq. (8.88), the width, w, of the “circle” along the 0 axis is 
roughly w = 2A Q./uj n . The motion looks like basically like a circle if w A0, that is, 
if AO>f2 s . * 

26. Rolling straight? 

Intuitively, it is fairly clear that the sphere cannot change direction. But it is a little 
tricky to prove. Qualitatively, we can reason as follows. Assume there is a nonzero 
friction force at the contact point. (The normal force is irrelevant here, because it 
doesn’t provide a torque about the CM. This is what is special about a sphere.) 
Then the ball will accelerate in the direction of this force. However, you can show 
with the right-hand rule that this force will produce a torque that will cause the 
angular momentum to change in a way that corresponds to the ball accelerating in 
the direction opposite to the friction force. There is thus a contradiction, unless the 
friction force equals zero. 

Let’s now be rigorous. Let the angular velocity of the ball be 00 . The non-slipping 
condition says that the ball’s velocity equals 


v = ux(4 (8.194) 

where a is the radius of the sphere. The ball’s angular momentum is 


L = Iu. 


(8.195) 
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The friction force from the ground (if it exists) at the contact point will change both 
the momentum and the angular momentum of the ball. F = dp/dt gives 



and r = dL/dt (relative to the center of the ball) gives 

| ~az) xF = ^, (8.197) 

because the force is applied at position — az relative to the ball’s center. 

We will now show that the preceding four equations imply Co = 0, that is, v is 
constant. Use the v from eq. (8.194) in eq. (8.196), and then plug the resulting F 
into eq. (8.197). Also, use the L from eq. (8.195) in eq. (8.197). The result is 

(—az) x (moo x (az)) = loo. (8.198) 

Because the vector Co lies in the horizontal plane, you can work out that zx(uxz) = Co. 
Therefore, we have 

2 

—ma 2 Co = -ma 2 Co, (8.199) 

and hence Co 0, as we wanted to show. 

27. Ball on paper 

Our strategy will be to produce (and equate) two different expressions for the total 
change in the angular momentum of the ball (relative to its center). The first comes 
from the effects of the friction force on the ball. The second comes from looking at 
the initial and final motion. 

To produce our first expression for AL, note that the normal force provides no torque, 
so we may ignore it. The friction force, F, from the paper changes both p and L, 
according to, 

Ap = J Fdt, 

AL = J rdt = J(-Rz)xFdt = (-Rz) x j F dt. (8.200) 

Both of these integrals run over the entire slipping time, which may include time on 
the table after the ball leaves the paper. In the second line above, we have used the 
fact that the friction force always acts at the same location, namely (—Rz), relative 
to the center of the ball. The two above equations yield 

AL = (-Rz) x Ap. (8.201) 

To produce our second equation for AL, let’s examine how L is related to p when 
the ball is rolling without slipping, which is the case at both the start and the finish. 
When the ball is not slipping, we have the situation show in Fig. 8.76 (assuming that 
the ball is rolling to the right). The magnitudes of p and L are given by 


P = ™, 

L = loo = ^mR 2 oo = ^Rm(Rio) = ^Rmv = g-Rp, (8.202) 



Figure 8.76 
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where we have used the non-slipping condition, v = Rlu. (The actual I = (2/5) mi? 2 
value for a solid sphere will not be important for the final result.) We now note that 
the directions of L and p can be combined with the above L = 2Rp/5 scalar relation 
to give 

L = giZz x p, (8.203) 

where z points out of the page. Since this relation is true at both the start and the 
finish, it must also be true for the differences in L and p. That is, 

2 

AL = -Rz x Ap. (8.204) 

Eqs. (8.201) and (8.204) give 

2 

{—Rz) x Ap = -Rz x Ap 

=> 0 = z x Ap. (8.205) 

There are three ways this cross product can be zero: 

• Ap is parallel to z. But it isn’t, because Ap lies in the horizontal plane. 

• z = 0. Not true. 

• Ap = 0. This one must be true. Therefore, Av = 0, as we wanted to show. 
Remarks: 

(a) As stated in the problem, it’s fine if you move the paper in a jerky motion, so that the 
ball slips around on it. We assumed nothing about the nature of the friction force in 
the above reasoning. And we used the non-slipping condition only at the initial and 
final times. The intermediate motion is arbitrary. 

(b) As a special case, if you start a ball at rest on a piece of paper, then no matter how 
you choose to (horizontally) slide the paper out from underneath the ball, the ball will 
be at rest in the end. 

(c) You are encouraged to experimentally verify that all these crazy claims are true. Make 
sure that the paper doesn’t wrinkle (this would allow a force to be applied at a point 
other than the contact point). And balls that don’t squish are much better, of course 
(for the same reason). & 

28. Ball on turntable 

Let the angular velocity of the turntable be flz, and let the angular velocity of the 
ball be u>. If the ball is at position r (with respect to the lab frame), then its velocity 
(with respect to the lab frame) may be broken up into the velocity of the turntable 
(at position r) plus the ball’s velocity with respect to the turntable. The non-slipping 
condition says that this latter velocity is given by u> x (az), where a is the radius of 
the ball. The ball’s velocity with respect to the lab frame is therefore 

v = (flz) x r + w x (az). (8.206) 

The important point to realize in this problem is that the friction force from the 
turntable is responsible for changing both the ball’s linear momentum and its angular 
momentum. In particular, F = dp/dt gives 


dv 

dt 


(8.207) 
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And the angular momentum of the ball is L = Iu>, so r = dL/dt (relative to the 
center of the ball) gives 


because the force is applied at position — az relative to the center. 

We will now use the previous three equations to demonstrate that the ball undergoes 
circular motion. Our goal will be to produce an equation of the form, 


since this describes circular motion, with frequency ft' (to be determined). Plugging 
the expression for F from eq. (8.207) into eq. (8.208) gives 


Taking the derivative of eq. (8.206) giv 


Since the vector dv/dt lies in the horizontal plane, it is easy to work out the cross- 
product in the right term (or just use the identity (A x B) x C = (A - C)B — (B • C)A) 


For a uniform sphere, I = (2/5) m 


Therefore, in view of eq. (8.209), we see that the ball undergoes circular motion, with 
a frequency equal to 2/7 times the frequency of the turntable. This result for the 
frequency does not depend on initial conditions. 


(a) Integrating eq. (8.213) from the initial t 


— v 0 — (^Ojzx (r-r 0 ). 
m show) in the more suggestive form, 


(8.215) 
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This equation describes circular motion, with the center located at the point, 

rc = r ° + l 2 ( z xvo) , (8.216) 

and with radius, 

fi = |r °- rc| = ^ xvo| = §r (8 - 217) 

(b) There are a few special cases to consider: 

• If Vo = 0 (that is, if the spinning motion of the ball exactly cancels the rotational 
motion of the turntable), then R = 0 and the ball remains in the same place (of 
course). 

• If the ball is initially not spinning, and just moving along with the turntable, 
then vo = firo- The radius of the circle is therefore R = (7/2)ro, and its center is 
located at (from eq. (8.216)) 

rc = ro + i ( ~ nro) = ~ir- (8 - 218) 

The point on the circle diametrically opposite to the initial point is therefore at 
a radius r c + R = (5/2)r 0 + (7/2)r 0 = 6r 0 . 

• If we want the center of the circle be the center of the turntable, then eq. (8.216) 
says that we need (7/2f2)z x vo = — ro. This implies that vo has magnitude 
v 0 = (2/7)f2ro and points tangentially in the same direction as the turntable 
moves. (That is, the ball moves at 2/7 times the velocity of the turntable beneath 
it.) 

(c) The fact that the frequency (2/7)fi is a rational multiple of O means that the ball will 
eventually return to the same point on the turntable. In the lab frame, the ball will 
trace out two circles in the time it takes the turntable to undergo seven revolutions. 
From the point of view of someone on the turntable, the ball will “spiral” around five 
times before returning to the original position. 

(d) If we look at a ball with moment of inertia I = r/ma 2 (so a uniform sphere has r) = 2/5), 
then you can show that the “2/7” in eq. (8.213) gets replaced by “q/(l +f?)”. If a ball 
has most of its mass concentrated at its center (so that r? —* 0), then the frequency of 
the circular motion goes to 0, and the radius goes to oo (as long as vo ^ 0). & 



8.10. SOLUTIONS 


VIII-71 



Chapter 9 


Accelerated Frames of 
Reference 

Copyright 2004 by David Morin, morin@physics.harvard.edu 


Newton’s laws hold only in inertial frames of reference. However, there are many 
non-inertial (that is, accelerated) frames of reference that we might reasonably want 
to study, such as elevators, merry-go-rounds, and so on. Is there any possible way 
to modify Newton’s laws so that they hold in non-inertial frames, or do we have to 
give up entirely on F = ma? 

It turns out that we can indeed hold onto our good friend F = ma, provided 
that we introduce some new “fictitious” forces. These are forces that a person in 
the accelerated frame thinks exist. If she applies F = ma while including these new 
forces, then she will get the correct answer for the acceleration, a, as measured with 
respect to her frame. 

To be quantitative about all this, we’ll have to spend some time determining 
how the coordinates (and their derivatives) in an accelerated frame relate to those 
in an inertial frame. But before diving into that, let’s look at a simple example 
which demonstrates the basic idea of fictitious forces. 


Example (The train): Imagine that you are standing on a train that is accelerating 
to the right with acceleration a. If you wish to remain at the same spot on the 
train, there must be a friction force between the floor and your feet, with magnitude 
Ff = ma, pointing to the right. Someone standing in the inertial frame of the ground 
will simply interpret the situation as, “The friction force, Ff = ma, causes your 
acceleration, a.” 

How do you interpret the situation, in the frame of the train? (Imagine that there 
are no windows, so that all you see is the inside of the train.) As we will show below 
in eq. (9.11), you will feel a fictitious translation force, F trans = —ma, pointing to 
the left. You will therefore interpret the situation as, “In my frame (the frame of the 
train), the friction force Ff = ma pointing to my right exactly cancels the mysterious 
Ftrans = —ma force pointing to my left, resulting in zero acceleration (in my frame).” 
Of course, if the floor of the train is frictionless so that there is no force at your feet, 
then you will say that the net force on you is F trans = —ma, pointing to the left. You 
will therefore accelerate with acceleration a to the left, with respect to your frame 
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(the train). In other words, you will remain motionless with respect to the inertial 
frame of the ground, which is all quite obvious to someone standing on the ground. 
In the case where the friction force at your feet is nonzero, but not large enough 
to balance out the whole F trans = —ma force, you will end up being jerked toward 
the back of the train. This undesired motion will continue until you make some 
adjustments with your feet or hands in order to balance out all of the F trans force. 
Such adjustments are generally necessary on a subway train, at least here in Boston, 
where hands are often crucial. 


Let’s now derive the fictitious forces in their full generality. The main task here 
is to relate the coordinates in an accelerated frame to those in an inertial frame, so 
this endeavor will require a bit of math. 


9.1 Relating the coordinates 

Consider an inertial coordinate system with axes x r , y : , and z, , and let there be 
another (possibly accelerating) coordinate system with axes x, y, and z. These axes 
will be allowed to change in an arbitrary manner with respect to the inertial frame. 
That is, the origin may undergo acceleration, and the axes may rotate. (This is 
the most general possible motion, as we saw in Section 8.1.) These axes may be 
considered to be functions of the inertial axes. Let Oi and O be the origins of the 
two coordinate systems. 

Let the vector from 0\ to O be R. Let the vector from 0\ to a given particle be 
iq. And let the vector from O to the particle be r. (See Fig. 9.1 for the 2-D case.) 
Then 

r I = R + r. (9.1) 

These vectors have an existence that is independent of any specific coordinate sys¬ 
tem, but let us write them in terms of some definite coordinates. We may write 

R = X'ki + Yyi +Zi.i, 
r : = aqx: + y 1 jq + z t z,. 

r = rx + yy + zz. (9.2) 

For reasons that will become clear, we have chosen to write R and r, in terms of 
the inertial-frame coordinates, and r in terms of the accelerated-frame coordinates. 
If desired, eq. (9.1) may be written as 

ah*: + y 1 y : + z : = (Ax : + Yy 1 + Zz : ) + (rx + yy + zz). (9.3) 

Our goal is to take the second time derivative of eq. (9.1), and to interpret the 
result in an F = ma form. The second derivative of r : is simply the acceleration of 
the particle with respect to the inertial system, and so Newton’s second law tells us 
that F = mil. The second derivative of R is the acceleration of the origin of the 
moving system. The second derivative of r is the tricky part. Changes in r can come 
about in two ways. First, the coordinates ( x,y,z ) of r (which are measured with 
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respect to the moving axes) may change. And second, the axes x, y, z themselves 
may change. So even if ( x,y,z ) remain fixed, the r vector may still change. 1 Let’s 
be quantitative about this. 


Calculation of d 2 r/dt 2 

We should clarify our goal here. We would like to obtain d 2 r/dt 2 in terms of the 
coordinates in the moving frame, because we want to be able to work entirely in 
terms of the coordinates of the accelerated frame, so that a person in this frame can 
write down an F = ma. equation in terms of only her coordinates, without having 
to consider the underlying inertial frame at all. 2 

The following exercise in taking derivatives works for a general vector A = 
A x k + A y y + A z z in the moving frame (it’s not necessary that it be a position 
vector). So we’ll work with a general A and then set A = r when we’re done. 

To take d/dt of A = A x k + A y y + A z z, we can use the product rule to obtain 


dA 

dt 


fdA x 

V dt : 


dA v „ dA z A 

-Niry + Nir z ) 




(9.4) 


Yes, the product rule works with vectors too. We’re doing nothing more than 
expanding (A x + dA x )(k + dk) — A x k. etc., to first order. 

The first of the two terms in eq. (9.4) is simply the rate of change of A, as 
measured with respect to the moving frame. We will denote this quantity by 6A/8t. 

The second term arises because the coordinate axes are moving. In what manner 
are they moving? We have already extracted the motion of the origin of the moving 
system (by introducing the vector R), so the only thing left is a rotation about some 
axis u> through the origin (see Theorem 8.1). This axis may be changing in time, 
but at any instant a unique axis of rotation describes the system. The fact that the 
axis may change will be relevant in finding the second derivative of r, but not in 
finding the first derivative. 

We saw in Theorem 8.2 that a vector B of fixed length (the coordinate axes 
here do indeed have fixed length), rotating with angular velocity u> = u<2>, changes 
at a rate dH/dt = u: x B. In particular, dk/dt = u x x, etc. So in eq. (9.4), the 
A x (dk/dt) term, for example, equals A x {u> x x) = uj x (A x k). Adding on the y and 
z terms gives u> x {A x k + A y y + A z z) = u) x A. Therefore, eq. (9.4) yields 


dA SA 

- =-I- U) 

dt St 


A. 


(9.5) 


This agrees with the result obtained in Section 8.5, eq. (8.39). We’ve basically given 
the same proof here, but with a little more mathematical rigor. 

1 Remember, the r vector is not simply the ordered triplet ( x,y,z ). It is the whole expression, 
r = a:x + yy + zz,. So even if (a:, y, z) are fixed, meaning that r doesn’t change with respect to 
the moving system, r can still change with respect to the inertial system if the x, y, z axes are 
themselves moving. 

2 In terms of the inertial frame, d 2 r/dt 2 is simply d 2 (r r — R) /dt 2 , but this is not very enlightening 
by itself. 
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We still have to take one more time derivative. The time derivative of eq. (9.5) 
yields 


d 2 A 

dt 2 


d (8A\ 
dt\ St ) 


du 

+ —xA + w> 
dt 


dA 

dt 


(9.6) 


Applying eq. (9.5) to the first term (with SA/St in place of A), and plugging eq. 
(9.5) into the third term, gives 


d 2 A 
dt 2 


( S 2 A SA 

{w +a ’ x w 



A) + 2u> 


6 A 

' —-hwx(u) 


SA du 
St dt 


A ) 


(9.7) 


At this point, we will now set A = r, so we have 


d 2 r _ 5 2 r 


( 9 . 8 ) 


where v = Sr/St is the velocity of the particle, as measured with respect to the 
moving frame. 


9.2 The fictitious forces 


Prom eq. (9.1) we have 


d 2 r _ d 2 r 1 d 2 R 
dt 2 dt 2 dt 2 


(9.9) 


Let us equate this expression for d 2 r/dt 2 with the one in eq. (9.8), and then multiply 
through by the mass m of the particle. Recognizing that the m(d 2 r 1 /dt 2 ) term is 
simply the force F acting on the particle (F may be gravity, a normal force, friction, 
tension, etc.), we may write the result as 


S 2 r 


= F — m—rr ,— moo x (u x r) - 2mw x v — m—r- x r 
dt 2 dt 

— F + F^ranslation A F cen t r ifugal + F^oriolis T Fazjmuthal, (9.10) 


where the fictitious forces are defined as 



crR 

-mw x (u > 

—2mu: x v, 
du: 


—m 


dt 


(9.11) 


We have taken the liberty of calling these quantities “forces”, because the left- 
hand side of eq. (9.10) is simply m times the acceleration, as measured by someone 
in the accelerated frame. This person should therefore be able to interpret the 
right-hand side as some effective force. In other words, if a person in the accelerated 
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frame wishes to calculate ma acc = rn(S 2 r/St 2 ), she simply needs to take the true 
force F, and then add on all the other terms on the right-hand side, which she will 
then quite reasonably interpret as forces (in her frame). She will interpret eq. (9.10) 
as an F = ma statement in the form, 

m a acc = ^F acc . (9.12) 

Note that the extra terms in eq. (9.10) are not actual forces. The constituents of 
F are the only real forces in the problem. All we are saying is that if our friend in the 
moving frame assumes the extra terms are real forces, and if she then adds them to 
F, then she will get the correct answer for m(6 2 r/6t 2 ), the mass times acceleration 
in her frame. 

For example, consider a box (far away from other objects, in outer space) that 
accelerates at a rate of g = 10 m/s 2 in some direction. A person in the box will feel 
a fictitious force of Ft rans = mg down into the floor. For all she knows, she is in 
a box on the surface of the earth. If she performs various experiments under this 
assumption, the results will always be what she expects. The surprising fact that no 
local experiment can distinguish between the fictitious force in the accelerated box 
and the real gravitational force on the earth is what led Einstein to his Equivalence 
Principle and his theory of General Relativity (discussed in Chapter 13). These 
fictitious forces are more meaningful than you might think. 

As Einstein explored elevators, 

And studied the spinning ice-skaters, 

He eyed as suspicious, 

The forces, fictitious, 

Of gravity’s great imitators. 

Let’s now look at each of the fictitious forces in detail. The translational and 
centrifugal forces are fairly easy to understand. The Coriolis force is a little more 
difficult. And the azimuthal force can be easy or difficult, depending on how exactly 
u is changing (we’ll mainly deal with the easy case). 

9.2.1 Translation force: — md 2 H/dt 2 

This is the most intuitive of the fictitious forces. We’ve already discussed this force 
in the train example in the introduction to this chapter. If R is the position of the 
train, then F trans = —rnd 2 H/dt 2 is the fictitious force you feel in the accelerated 
frame. 

9.2.2 Centrifugal force: —mu x (a; x r) 

This force goes hand-in-hand with the mv 2 /r = rnru 2 centripetal acceleration as 
viewed by someone in an inertial frame. 
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Example 1 (Standing on a carousel): Consider a person standing motionless on 
a carousel. Let the carousel rotate in the x-y plane with angular velocity w =wz (see 
Fig. 9.2). What is the centrifugal force felt by a person standing at a distance r from 
the center? 

Figure 9.2 

Solution: u> x r has magnitude u>r and points in the tangential direction, in the 
direction of motion (wxris simply the velocity as viewed by someone on the ground, 
after all). Therefore, tow x (u> x r) has magnitude mru> 2 and points radially inward. 
Hence, the centrifugal force, —tow x (w x r), points radially outward with magnitude 
mruo 2 . 



i north 
co T S pole 



Figure 9.3 


-mfflx(fflxr) 



Figure 9.4 


Remark: If the person is not moving with respect to the carousel, and if w is constant, 
then the centrifugal force is the only non-zero fictitious force in eq. (9.10). Since the person 
is not accelerating in her rotating frame, the net force (as measured in her frame) must be 
zero. The forces in her frame are (1) gravity pulling downward, (2) a normal force pushing 
upward (which cancels the gravity), (3) a friction force pushing inward at her feet, and (4) 
the centrifugal force pulling outward. We conclude that the last two of these must cancel. 
That is, the friction force points inward with magnitude mrJ 2 . 

Of course, someone standing on the ground will observe only the first three of these forces, 
so the net force will not be zero. And indeed, there is a centripetal acceleration, v 2 /r = riv 2 , 
due to the friction force. To sum up: in the inertial frame, the friction force exists to provide 
an acceleration. In the rotating frame, the friction force exists to balance out the mysterious 
new centrifugal force, in order to yield zero acceleration. £ 


Example 2 (Effective gravity force, mg e fj ): Consider a person standing mo¬ 
tionless on the earth, at a polar angle 9. (See Fig. 9.3. The way we’ve defined 
it, 6 equals 7t/2 minus the latitude angle.) She will feel a force due to gravity, mg, 
directed toward the center of the earth. 3 But in her rotating frame, she will also feel 
a centrifugal force, directed away from the rotation axis. The sum of these two forces 
(that is, what she thinks is gravity) will not point radially, unless she is at the equator 
or at a pole. Let us denote the sum of these forces as m,g e ff. 

To calculate mg e ff, we must calculate —mu> x (wxr). The wxr part has magnitude 
Rio sin 9, where R is the radius of the earth, and it is directed tangentially along the 
latitude circle of radius RsinO. So —mu> x (w x r) points outward from the 2 -axis, 
with magnitude rriRio 2 sin 9, which is just what we expect for something traveling at 
frequency w in a circle of radius R sin 9. Therefore, the effective gravitational force, 

mge ff Em(g-wx(wxr)), (9.13) 

points slightly in the southerly direction (for someone in the northern hemisphere), 
as shown in Fig. 9.4. The magnitude of the correction term, mRu 2 sin 9, is small 
compared to g. Using ui ta 7.3 ■ 10 -5 s -1 (that is, one revolution per day, which is 27t 
radians per 86,400 seconds) and R ss 6.4 • 10 6 m, we find Rui ' 2 w .03 m/s 2 . Therefore, 
the correction to g is about 0.3% at the equator. But it is zero at the poles. Exercise 
1 and Problem 1 deal further with g eff . 

3 Note that we are using g to denote the acceleration due solely to the gravitational force. This 
isn’t the “g-value” that the person measures, as we will shortly see. 
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Remarks: In the construction of buildings, and in similar matters, it is of course g e ff, and 
not g, that determines the “upward” direction in which the building should point. The exact 
direction to the center of the earth is irrelevant. A plumb bob hanging from the top of a 
skyscraper touches exactly at the base. Both the bob and the building point in a direction 
slightly different from the radial, but no one cares. 

If you look in table and find that the acceleration due to gravity in New York is 9.803m/s 2 , 
remember that this is the g e g value and not the g value (which describes only the gravitational 
force, in our terminology). The way we’ve defined it, the g value is the acceleration with 
which things would fall if the earth kept its same shape but somehow stopped spinning. The 
exact value of g is therefore generally irrelevant. & 


9.2.3 Coriolis force: —2 mu x v 

While the centrifugal force is very intuitive concept (everyone has gone around a 
corner in a car), the same thing cannot be said about the Coriolis force. This force 
requires a non-zero velocity v relative to the accelerated frame, and people normally 
don’t move appreciably with respect to their car while rounding a corner. To get a 
feel for this force, let’s look at two special cases. 


Case 1 (Moving radially on a carousel): A carousel rotates with constant 
angular speed uj. Consider someone walking radially inward on the carousel, with 
speed v (relative to the carousel) at radius r (see Fig. 9.5). uj points out of the page, 
so the Coriolis force -2mw x v points tangentially in the direction of the motion of 
the carousel (that is, to the person’s right, in our scenario), with magnitude 

F c or = 2 mujv. (9-14) 

Let’s assume that the person counters this force with a tangential friction force of 
2 mujv (pointing to his left) at his feet, so that he continues to walk on the same radial 
line. 4 

Why does this Coriolis force (and hence the tangential friction force) exist? It exists 
so that the resultant friction force changes the angular momentum of the person 
(measured with respect to the lab frame) in the proper way. To see this, take d/dt 
of L = mr 2 uj, where to is the person’s angular speed with respect to the lab frame, 
which is also the carousel’s angular speed. Using dr/dt = —v, we have 

^ = —2 mrujv + mr 2 (du> / dt ). (9.15) 

But duj/dt = 0, because the person is staying on one radial line, and we’re assuming 
that the carousel is arranged to keep a constant uj. Eq. (9.15) then gives dL/dt = 
—2 mrujv. So the L of the person changes at a rate of —(2 mujv)r. This is simply the 
radius times the tangential friction force applied by the carousel. In other words, it 
is the torque applied to the person. 

4 There is also the centrifugal force, which is countered by a radial friction force at the person’s 
feet. This effect won’t be important here. 



Figure 9.5 
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Remark: What if the person does not apply a tangential friction force at his feet? Then 
the Coriolis force of 2 mwv produces a tangential acceleration of 2 uiv in his frame, and 
hence the lab frame also. This acceleration exists essentially to keep the angular momentum 
(measured with respect to the lab frame) of the person constant. (It is constant in this 
scenario, because there are no tangential forces in the lab frame.) To see that this tangential 
acceleration is consistent with conservation of angular momentum, set dL/dt = 0 in eq. 
(9.15) to obtain 2u>v = r(dw/dt). The right-hand side of this is by definition the tangential 
acceleration. Therefore, saying that L is conserved is the same as saying that 2 ujv is the 
tangential acceleration (for this situation where the inward radial speed is v). & 


Case 2 (Moving tangentially on a carousel): Now consider someone walking 
tangentially on a carousel, in the direction of the carousel’s motion, with speed v 
(relative to the carousel) at constant radius r (see Fig. 9.6). The Coriolis force 
—2 tow x v points radially outward with magnitude 2rnojv. Assume that the person 
applies the friction force necessary to continue moving at radius r. 

There is a simple way to see why this outward force of 2 muv exists. Let V = ur be 
the speed of a point on the carousel at radius r, as viewed by an outside observer. 
If the person moves tangentially (in the same direction as the spinning) with speed 
v relative to the carousel, then his speed as viewed by the outside observer is V + v. 
The outside observer therefore sees the person walking in a circle of radius r at speed 
V + v. The acceleration of the person in the ground frame is therefore (V + v)' 2 /r. 
This acceleration must be caused by an inward-pointing friction force at the person’s 
feet, so 

t i{V + v) 2 _ mV^_ 2 mVv mv 2 




(9.16) 


This friction force is of course the same in any frame. How, then, does our person 
on the carousel interpret the three pieces of the inward-pointing friction force in 
eq. (9.16)? The first term simply balances the outward centrifugal force due to the 
rotation of the frame, which he always feels. The third term is simply the inward 
force his feet must apply if he is to walk in a circle of radius r at speed v, which is 
exactly what he is doing in the rotating frame. The middle term is the additional 
inward friction force he must apply to balance the outward Coriolis force of 2 muv 
(using V = wr). 

Said in an equivalent way, the person on the carousel will write down an F = ma 
equation of the form (taking radially inward to be positive), 


n(Vf 


•nV 2 2 mVv 


F friction + F cent + F cor . 


(9.17) 


We see that the net force he feels is indeed equal to his ma (where a is measured with 
respect to his rotating frame). 


For cases in between the two special ones above, things aren’t so clear, but that’s 
the way it goes. Note that no matter what direction you move on a carousel, the 
Coriolis force always points in the same perpendicular direction relative to your 
motion. Whether it’s to the right or to the left depends on the direction of the 
rotation. But given u>, you’re stuck with the same relative direction of force. 
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On a merry-go-round in the night, 
Coriolis was shaken with fright. 

Despite how he walked, 

’Twas like he was stalked 

By some fiend always pushing him right. 

Let’s do some more examples... 


Example 1 (Dropped ball): A ball is dropped from a height h, at a polar angle 
9 (measured down from the north pole). How far to the east is the ball deflected, by 
the time it hits the ground? 

Solution: Note that the ball is indeed deflected to the east, independent of which 
hemisphere it is in. The angle between us and v is tt — 9, so the Coriolis force, 
—2 mus x v, is directed eastward with ma gnitu de 2 musv sin 9, where v = gt is the speed 
at time t (t runs from 0 to the usual a/2 h/g). 5 The eastward acceleration at time t 
is therefore 2usgt sin 9. Integrating this to obtain the eastward speed (with an initial 
eastward speed of 0) gives u eas t = usgt 2 sm9. Integrating once more to obtain the 
eastward deflection (with an initial eastward deflection of 0) gives d eas t = usgt 3 sin 9/3. 
Plugging in t= y/2 h/g gives 



This is valid up to second-order effects in the small dimensionless quantity us^fhfg. 
For an everyday value of h, this quantity is indeed small, since w ~ 7.3 • 10 5 s -1 . 


Example 2 (Foucault’s pendulum): This is the classic example of a consequence 
of the Coriolis force. It unequivocally shows that the earth rotates. The basic idea 
is that due to the rotation of the earth, the plane of a swinging pendulum rotates 
slowly, with a calculable frequency. 

In the special case where the pendulum is at one of the poles, this rotation is easy to 
understand. Consider the north pole. An external observer, hovering above the north 
pole and watching the earth rotate, sees the pendulum’s plane stay fixed (with respect 
to the distant stars) while the earth rotates counterclockwise beneath it. 6 Therefore, 
to an observer on the earth, the pendulum’s plane rotates clockwise (viewed from 
above). The frequency of this rotation is of course just the frequency of the earth’s 
rotation, so the earth-based observer sees the pendulum’s plane make one revolution 
each day. 

What if the pendulum is not at one of the poles? What is the frequency of the 
precession? Let the pendulum be located at a polar angle 9. We will work in the ap¬ 
proximation where the velocity of the pendulum bob is horizontal. This is essentially 

technically, v = gt isn’t quite correct. Due to the Coriolis force, the ball will pick up a small 
sideways velocity component (this is the point of the problem). This component will then produce 
a second-order Coriolis force that effects the vertical speed (see Exercise 2). We may, however, 
ignore this small effect in this problem. 

6 Assume that the pivot of the pendulum is a frictionless bearing, so that it can’t provide any 
torque to twist the pendulum’s plane. 
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true if the pendulum’s string is very long; the correction due to the rising and falling 
of the bob is negligible. The Coriolis force, —2 mu> x v, points in some complicated 
direction, but fortunately we are concerned only with the component that lies in the 
horizontal plane. The vertical component serves only to modify the apparent force of 
gravity and is therefore negligible. (Although the frequency of the pendulum depends 
on g, the resulting modification is very small.) 

With this in mind, let’s break u into vertical and horizontal components in a coordi¬ 
nate system located at the pendulum. From Fig. 9.7, we see that 

u> = tucosflz + wsin#y. (9.19) 

We’ll ignore the y-component, because it produces a Coriolis force in the z direction 
(since v lies in the horizontal x-y plane). So for our purposes, oj is essentially equal 
to lo cos Oz. From this point on, the problem of finding the frequency of precession 
can be done in numerous ways. We’ll present two solutions. 

First solution (The slick way): The horizontal component of the Coriolis force 
has magnitude 

Fcor™ = I — 2rnfy;cos#z) x v| = 2m(w cos ff)v, (9.20) 

and it is perpendicular to v(t). Therefore, as far as the pendulum is concerned, it 
is located at the north pole of a planet called Terra Costhetica which has rotational 
frequency a; cos 6. But as we saw above, the precessional frequency of a Foucault 
pendulum located at the north pole of such a planet is simply 

u) F = wcosO, (9-21) 

in the clockwise direction. So that’s our answer. 7 

Second solution (In the pendulum’s frame): Let’s work in the frame of the 
vertical plane that the Foucault pendulum sweeps through. Our goal is to find the 
rate of precession of this frame. With respect to a frame fixed on the earth (with axes 
x, y, and z), we know that this plane rotates with frequency Wp = —uiz if we’re at 
the north pole (0 = 0), and frequency up = 0 if we’re at the equator (0 = n/2). So 
if there’s any justice in the world, the general answer has got to be u>f = —wcos#z, 
and that’s what we’ll now show. 

Working in the frame of the plane of the pendulum is useful, because we can take 
advantage of the fact that the pendulum feels no sideways forces in this frame, because 
otherwise it would move outside of the plane (which it doesn’t, by definition). 

The frame fixed on the earth rotates with frequency u) = u> cos#z + <usin#y, with 
respect to the inertial frame. Let the pendulum’s plane rotate with frequency u>f = 
lof z with respect to the earth frame. Then the angular velocity of the pendulum’s 
frame with respect to the inertial frame is 

u> + u>f = {cocos 6 + u) F )z + wsin#y. (9.22) 

To find the horizontal component of the Coriolis force in this rotating frame, we 
only care about the z part of this frequency. The horizontal Coriolis force therefore 
has magnitude 2m (w cos# + ui F )v. But in the frame of the pendulum, there is no 
horizontal force, so this must be zero. Therefore, 

u)f = —wcos0. (9.23) 

7 As mentioned above, the setup isn’t exactly like the one on the new planet. There will be a 
vertical component of the Coriolis force for the pendulum on the earth, but this effect is negligible. 
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This agrees with eq. (9.21), where we just wrote down the magnitude of wp. 


9.2.4 Azimuthal force: —m(du>/dt) x r 

In this section, we will restrict ourselves to the simple and intuitive case where u> 
changes only in magnitude (that is, not in direction). * * * 8 In this case, the azimuthal 
force may be written as 

F az = —muu) x r. (9.24) 

This force is easily understood by considering a person standing at rest with respect 
to a rotating carousel. If the carousel speeds up, then the person must feel a tan¬ 
gential friction force at his feet if he is to remain fixed on the carousel. This friction 
force equals mat an , where at an = rCj is the tangential acceleration as measured in the 
ground frame. But from the person’s point of view in the rotating frame, he is not 
moving, so there must be some other mysterious force that balances the friction. 
This is the azimuthal force. Quantitatively, when Cj is orthogonal to r, we have 
|u> x r| = r, so the azimuthal force in eq. (9.24) has magnitude rnrio. This is the 
same as the magnitude of the friction force, as it should be. 

What we have here is exactly the same effect that we had with the translation 
force on the accelerating train. If the floor speeds up beneath you, then you must 
apply a friction force if you don’t want to be thrown backwards with respect to the 
floor. If you shut your eyes and ignore the centrifugal force, then you can’t tell if 
you’re on a linearly accelerating train, or on an angularly accelerating carousel. The 
translation and azimuthal forces both arise from the acceleration of the floor. (Well, 
for that matter, the centrifugal force does, too.) 

We can also view things in terms of rotational quantities, as opposed to the 
linear at an acceleration above. If the carousel speeds up, then a torque must be 
applied to the person if he is to remain fixed on the carousel, because his angular 
momentum in the fixed frame increases. Therefore, he must feel a friction force at 
his feet. 

Let’s show that this friction force, which produces the change in angular momen¬ 
tum of the person in the fixed frame, exactly cancels the azimuthal force in the rotat¬ 
ing frame, thereby yielding zero net force in the rotating frame. 9 Since L = mr 2 uj, 
we have dL/dt = mr 2 u (assuming r is fixed). And since dL/dt = r = rF. we 
see that the required friction force is F = mru. And as we saw above, when u> is 
orthogonal to r, the azimuthal force in eq. (9.24) also equals mru, in the direction 
opposite to the carousel’s motion. 


Example (Spinning ice skater): We have all seen ice skaters increase their angular 

speed by bringing their arms in close to their body. This is easily understood in 

terms of angular momentum; a smaller moment of inertia requires a larger u, to keep 

8 The more complicated case where to changes direction is left for Problem 9. 

9 This is basically the same calculation as the one above, with an extra r thrown in. 
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Figure 9.8 


L constant. But let’s analyze the situation here in terms of fictitious forces. We 
will idealize things by giving the skater massive hands at the end of massless arms 
attached to a massless body. 10 Let the hands have total mass m, and let them be 
drawn in radially. 

Look at things in the skater’s frame (which has an increasing w), defined by the vertical 
plane containing the hands. The crucial thing to realize is that the skater always 
remains in the skater’s frame (a fine tautology, indeed). Therefore, the skater must 
feel zero net tangential force in her frame, because otherwise she would accelerate 
with respect to it. Her hands are being drawn in by a muscular force that works 
against the centrifugal force, but there is no net tangential force on the hands in the 
skater’s frame. 

What are the tangential forces in the skater’s frame? (See Fig. 9.8.) Let the hands be 
drawn in at speed v. Then there is a Coriolis force (in the same direction as the spin¬ 
ning) with magnitude 2 mutv. There is also an azimuthal force with magnitude mril) 
(in the direction opposite the spinning, as you can check). Since the net tangential 
force is zero in the skater’s frame, we must have 

2 mujv = mruj. (9.25) 

Does this relation make sense? Well, let’s look at things in the ground frame. The 
total angular momentum of the hands in the ground frame is constant. Therefore, 
d(rnr 2 u;)/dt = 0. Taking this derivative and using dr/dt = —v gives eq. (9.25). 


A word of advice about using fictitious forces: Decide which frame you are going 
to work in (the lab frame or the accelerated frame), and then stick with it. The 
common mistake is to work a little in one frame and a little on the other, without 
realizing it. For example, you might introduce a centrifugal force on someone sitting 
at rest on a carousel, but then also give her a centripetal acceleration. This is 
incorrect. In the lab frame, there is a centripetal acceleration (caused by the friction 
force) and no centrifugal force. In the rotating frame, there is a centrifugal force 
(which cancels the friction force) and no centripetal acceleration (because the person 
is sitting at rest on the carousel, consistent with the fact that the net force is zero). 
Basically, if you ever mention the words “centrifugal” or “Coriolis”, etc., then you 
had better be working in an accelerated frame. 


■’This reminds me of a joke about a spherical 
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9.3 Exercises 

Section 9.2: The fictitious forces 

1. Magnitude of g e g- * 

What is the magnitude of g e g? Give you answer to the leading-order correction 
in u>. 

2. Corrections to gravity ** 

A mass is dropped from rest from a point directly above the equator. Let the 
initial distance from the center of the earth be R, and let the distance fallen 
be d. If we consider only the centrifugal force, then the correction to g is 
lj 2 (R — d). There is, however, also a second-order Coriolis effect. What is the 
sum of these corrections? 11 

3. Southern deflection ** 

A ball is dropped from a height h (small compared to the radius of the earth), 
at a polar angle 0. How far to the south (in the northern hemisphere) is it 
deflected away from the g e ff direction, by the time it hits the ground? (This 
is a second order Coriolis effect.) 

4. Oscillations across equator * 

A bead lies on a frictionless wire which lies in the north-south direction across 
the equator. The wire takes the form of an arc of a circle; all points are the 
same distance from the center of the earth. The bead is released from rest 
at a short distance from the equator. Because g e g does not point directly 
toward the earth’s center, the bead will head toward the equator and undergo 
oscillatory motion. What is the frequency of these oscillations? 

5. Roche limit * 

Exercise 4.29 dealt with the Roche limit for a particle falling in radially toward 
a planet. Show that the Roche limit for a object in a circular orbit is 

d = R^^) /3 ■ (9.26) 

6. Spinning bucket ** 

An upright bucket of water is spun at frequency u around the vertical axis. If 
the water is at rest with respect to the bucket, find the shape of the water’s 
surface. 

7. Coin on turntable *** 

A coin stands upright at an arbitrary point on a rotating turntable, and rotates 
(without slipping) with the required speed to make its center remain motionless 

11 g will also vary with height, but let’s not worry about that here. 
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in the lab frame. In the frame of the turntable, the coin will roll around in a 
circle with the same frequency as that of the turntable. In the frame of the 
turntable, show that 

(a) F = dp/dt , and 

(b) r = dL/dt. 

8. Precession viewed from rotating frame *** 

Consider a top made of a wheel with all its mass on the rim. A massless rod 
(perpendicular to the plane of the wheel) connects the CM to the pivot. Initial 
conditions have been set up so that the top undergoes precession, with the rod 
always horizontal. 

In the language of Figure 8.27, we may write the angular velocity of the top 
as uj = Liz + u/x 3 (where £3 is horizontal here). Consider things in the frame 
rotating around the z-axis with angular speed fi. In this frame, the top spins 
with angular speed u>' around its fixed symmetry axis. Therefore, in this frame 
r = 0, because there is no change in L. 

Verify explicitly that r = 0 (calculated with respect to the pivot) in this 
rotating frame (you will need to find the relation between u/ and Ll ). In other 
words, show that the torque due to gravity is exactly canceled by the torque do 
to the Coriolis force. (You can easily show that the centrifugal force provides 
no net torque.) 
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9.4 Problems 


Section 9.2: The fictitious forces 

1. g eff VS. g * 

For what 0 is the angle between rag e g- and g maximum? 


2. Longjumping in g eff * 

If a longjumper can jump 8 meters at the north pole, how far can he jump 
at the equator? (Ignore effects of wind resistance, temperature, and runways 
made of ice. And assume that the jump is made in the north-south direction 
at the equator, so that there is no Coriolis force.) 

3. Lots of circles ** 

(a) Two circles in a plane, C\ and C-i , each rotate with frequency u> (relative 
to an inertia frame). See Fig. 9.9. The center of C\ is fixed in an 
inertial frame, and the center of 0% is fixed on C\ . A mass is fixed on 
C 2 . The position of the mass relative to the center of C\ is R(£). Find 
the fictitious force felt by the mass. 

(b) N circles in a plane, C), each rotate with frequency u (relative to an 
inertia frame). See Fig. 9.10. The center of C\ is fixed in an inertial 
frame, and the center of C\ is fixed on Q_ 1 (for i = 2,... ,N). A mass 
is fixed on Cn- The position of the mass relative to the center of C\ is 
R(t). Find the fictitious force felt by the mass. 

4. Which way down? * 

You are floating high up in a balloon, at rest with respect to the earth. Give 
three quasi-reasonable definitions for which point on the ground is right “be¬ 
low” you. 

5. Mass on a turntable * 

A mass rests motionless with respect to the lab frame, while a frictionless 
turntable rotates beneath it. The frequency of the turntable is u>, and the 
mass is located at radius r. In the frame of the turntable, what are the forces 
acting on the mass? 


co 



Figure 9.9 



6. Released mass * 

A mass is bolted down to a frictionless turntable. The frequency of rotation is 
uj, and the mass is located at a radius a. The mass is released. Viewed from 
an inertial frame, it travels in a straight line. In the rotating frame, what path 
does the mass take? Specify r(t) and 9(t), where 9 is the angle with respect 
to the initial radius. 




o»V 
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Figure 9.11 


7. Coriolis circles * 

A puck slides with speed v on frictionless ice. The surface is “level”, in the 
sense that it is orthogonal to g e ff at all points. Show that the puck moves in a 
circle, as seen in the earth’s rotating frame. What is the radius of the circle? 
What is the frequency of the motion? Assume that the radius of the circle is 
small compared to the radius of the earth. 

8. Shape of the earth *** 

The earth bulges slightly at the equator, due to the centrifugal force in the 
earth’s rotating frame. Show that the height of a point on the earth (relative 
to a spherical earth), is given by 

h = R^pj (3 sin 2 9- 2), (9.27) 

where 6 is the polar angle (the angle down from the north pole), and R is the 
radius of the earth. 

9. Changing u>’s direction *** 

Consider the special case where a reference frame’s u> changes only in direction 
(and not in magnitude). In particular, consider a cone rolling on a table, which 
is a natural example of such a situation. 

The instantaneous oj for a rolling cone is its line of contact with the table, be¬ 
cause these are the points that are instantaneously at rest. This line precesses 
around the origin. Let the frequency of the precession be 12. Let the origin of 
the cone frame be the tip of the cone. This point remains fixed in the inertial 
frame. 

In order to isolate the azimuthal force, consider the special case of a point 
P that lies on the instantaneous u) and which is motionless with respect to 
the cone (see Fig. 9.11). From eq. (9.11), we then see that the centrifugal, 
Coriolis, and translation forces are zero. The only remaining fictitious force is 
the azimuthal force, and it arises from the fact that P is accelerating up away 
from the table. 


(a) Find the acceleration of P. 

(b) Calculate the azimuthal force on a mass m located at P, and show that 
the result is consistent with part (a). 
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9.5 Solutions 


1. g eff VS. g 

The forces mg and F cent are shown in Fig. 9.12. The magnitude of F cent is rnRio 2 sin 0, 
so the component of F cent perpendicular to mg is mRu 2 sin 9 cos 9 = mRu 2 (sin 29) /2. 
For small F cent , maximizing the angle between g e ff and g is equivalent to maximizing 
this perpendicular component. Therefore, we obtain the maximum angle when 


(9.28) 


The maximum angle turns out to be (j> ~ sin<f> ss (mRu 2 (sin |)/2 'j/mg = Ru 2 /2g « 
0.0017, which is about 0.1°. For this 6 = 7r/4 case, the line along g e ff misses the 
center of the earth by about 10 km, as you can show. 


Fcent 



Figure 9.12 


Remark: The above method works only when the magnitude of F cent is much smaller than 
mg; we dropped higher order terms in the above calculation. One way of solving the problem 
exactly is to break F cen t into components parallel and perpendicular to g. If <j> is the angle 
between g ei r and g, then from Fig. 9.12 we have 


mg — mRu 2 sin 2 9 


(9.29) 


We can then maximize (f> by taking a derivative. But be careful if Ruj 2 > g, in which case 
maximizing (f> doesn’t mean maximizing tan <p. We’ll let you work this out, and instead we’ll 
give the following slick geometric solution. 

Draw the F cen t vectors, for various 9, relative to mg. The result looks like Fig. 9.13. Since 
the lengths of the F cen t vectors are proportional to sin 9, you can show that the tips of the 
F cent vectors form a circle. The maximum cf> is therefore achieved when g e ff is tangent to 
this circle, as shown in Fig. 9.14. In the limit where g Ruj 2 (that is, in the limit of a 
small circle), we want the point of tangency to be the rightmost point on the circle, so the 
maximum (f> is achieved when 9 = 7t/4, in which case sin0 « (Ru 2 /2)/g. But in the general 
case, Fig. 9.14 shows that the maximum <f> is given by 


mg — \mRu 2 


(9.30) 


In the limit of small oj, this is approximately Rio 2 /2g, as above. 

The above reasoning holds only if Rio 2 < g. In the case where Rio 2 > g (that is, the circle 
extends above the top end of the mg segment), the maximum 0 is simply it, and it is achieved 
at 9 = 7r/2. A 


2. Longjumping in g eff 

Let the jumper take off with speed v, at an angle 6. Then g e s{t/ 2) = v sin 9 tells us 
that the time in the air is t = 2vsin9/g e e. The distance traveled is therefore 


, . 2v 2 sin 9 cos 9 v 2 sin 29 

a = v x t = vt cos 9 = -=- 

<7eff <?eff 


(9.31) 


This is maximum when 9 = 7 t/4 , as we well know. So we see that d oc 1 / ^/g e s- Taking 
g e ff « 10 m/s 2 at the north pole, and <? e ff ~ (10 — 0.03) m/s 2 at the equator, we find 
that the jump at the equator is approximately 1.0015 times as long as the one on the 
north pole. So the longjumper gains about one centimeter. 



Figure 9.13 



Figure 9.14 


Remark: For a longjumper, the optimal angle of takeoff is undoubtedly not 7r/4. The 
act of changing the direction abruptly from horizontal to such a large angle would entail 
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a significant loss in speed. The optimal angle is some hard-to-determine angle less than 
7r/4. But this won’t change our general d oc 1 / y/g e « result (which follows from dimensional 
analysis), so our answer still holds, ft 

3. Lots of circles 

(a) The fictitious force, F f , on the mass has an F cent part and an F trans part, because 
the center of C 2 is moving. So the fictitious force is 

F f = mtv 2 r 2 + F trans , (9.32) 

where r 2 is the position of the mass in the frame of CV But F trans , which arises 
from the acceleration of the center of C 2 , is simply the centrifugal force felt by 
any point on C-\. Therefore, 

Ftrans = TO^V, (9.33) 

where ri is the position of the center of C 2 , in the frame of C\. Substituting 
this into eq. (9.32) gives 

F f = mu> 2 (r2 + ri) 

= mu 2 'R(t). (9.34) 

(b) The fictitious force, Ff, on the mass has an F cent part and an F tran s part, because 
the center of the IVth circle is moving. So the fictitious force is 

F f = muo 2 r N + F tranS)N , (9.35) 

where r N is the position of the mass in the frame of Cn■ But F tra ns,jv is simply 
the centrifugal force felt by a point on the (N — l)st circle, plus the translation 
force coming from the movement of the center of the (N — l)st circle. Therefore, 

Ffrans,« = TTUiJ 2 T N _ 1 + (9.36) 

Substituting this into eq. (9.35) and successively rewriting the F tranSj j terms in 
a similar manner, gives 

Ff = mu> 2 (r N + r N _ 1 -\ -+ iq) 

= mw 2 R(l). (9.37) 

The main point in this problem is that F cent is linear in r. 

Remark: There is actually a much easier way to see that Ff = mw 2 R(l). Since all the 
circles rotate with the same tv, they may as well be glued together. Such a rigid setup would 
indeed yield the same tv for all the circles. This is similar to the moon rotating once on its 
axis for every revolution is makes around the earth, thereby causing the same side to always 
face the earth. It is then clear that the mass simply moves in a circle at frequency tv, yielding 
a fictitious centrifugal force of mw 2 R(t). And we see that the radius R is in fact constant. 
ft 

4. Which way down? 

Here are three possible definitions of the point “below” you on the ground: (1) The 
point that lies along the line between you and the center of the earth, (2) The point 
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where a hanging plumb bob rests, and (3) The point where a dropped object hits the 
ground. 

The second definition is the most reasonable, because it defines the upward direction 
in which buildings are constructed. It differs from the first definition due to the 
centrifugal force which makes g e ff point in a slightly different direction from g. The 
third definition differs from the second due to the Coriolis force. The velocity of the 
falling object produces a Coriolis force which causes an eastward deflection. 

Note that all three definitions are equivalent at the poles. Additionally, definitions 1 
and 2 are equivalent at the equator. 

5. Mass on turntable 

In the lab frame, the net force on the mass is zero, because it is sitting at rest. (The 
normal force cancels the gravitational force.) But in the rotating frame, the mass 
travels in a circle of radius r, with frequency w. So in the rotating frame there must 
be a force of mu 2 r inward to account for the centripetal acceleration. And indeed, the 
mass feels a centrifugal force of muj 2 r outward, and a Coriolis force of 2 muiv = 2» 2 r 
inward, which sum to the desired force (see Fig. 9.15). 

Remark: The net inward force in this problem is a little different from that for someone 
swinging around in a circle in an inertial frame. If a skater maintains a circular path by 
holding onto a rope whose other end is fixed, she has to use her muscles to maintain the 
position of her torso with respect to her arm, and her head with respect to her torso, etc. 
But if a person takes the place of the mass in this problem, she needs to exert no effort to 
keep her body moving in the circle (which is clear, when looked at from the inertial frame), 
because each atom in her body is moving at (essentially) the same speed and radius, and 
therefore feels the same Coriolis and centrifugal forces. So she doesn’t really feel this force, 
in the same sense that someone doesn’t feel gravity when in free-fall with no air resistance. 
* 

6. Released mass 

Let the x'- and y'- axes of the rotating frame coincide with the x- and y- axes of 
the inertial frame at the moment the mass is released (at t = 0). Let the mass 
initially be located on the y'-axis. Then after a time t, the situation looks like that 
in Fig. 9.16. The speed of the mass is v = au>, so it has traveled a distance auit. 
The angle that its position vector makes with the inertial x-axis is therefore tan -1 ut, 
with counterclockwise taken to be positive. Hence, the angle that its position vector 
makes with the rotating y'- axis is O(t) = — (wt — tan -1 u>t). And the radius is simply 
r(t) = a\/l + u)H 2 . So for large t, r(t) w awt and 9(t) « —wt + n/2, which make 
sense. 

7. Coriolis circles 

By construction, the normal force from the ice exactly cancels all effects of the grav¬ 
itational and centrifugal forces in the rotating frame of the earth. We therefore need 
only concern ourselves with the Coriolis force. This force equals F cor = —2mw x v. 
Let the angle down from the north pole be 0; we assume the circle is small enough 
so that 0 is essentially constant throughout the motion. Then the component of the 
Coriolis force that points horizontally along the surface has magnitude / = 2 muiv cos 6 
and is perpendicular to the direction of motion. (The vertical component of the 
Coriolis force will simply modify the required normal force.) Because this force is 
perpendicular to the direction of motion, v does not change. Therefore, / is constant. 



Figure 9.15 


y 

acot t 



Figure 9.16 
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But a constant force perpendicular to the motion of a particle produces a circular 
path. The radius of the circle is given by 

mv 2 v 

2mu)vcos6= - => r = --(9.38) 

r 2u> cos 6 

The frequency of the circular motion is 

J = - = 2<ncos(9. (9.39) 

Remarks: To get a rough idea of the size of the circle, you can show (using uj « 7.3-10 -5 s -1 ) 
that r « 10 km when v = lm/s and 0 = 45°. Even the tiniest bit of friction will clearly 
make this effect essentially impossible to see. 

For the 6 « 7t/2 (that is, near the equator), the component of the Coriolis force along the 
surface is negligible, so r becomes large, and u/ goes to 0. 

For the 6 « 0 (that is, near the north pole), the Coriolis force essentially points along the 
surface. The above equations give r « v/(2ui), and u/ « 2u>. For the special case where the 
center of the circle is the north pole, this u' « 2u> result might seem incorrect, because you 
might want to say that the circular motion should be achieved by having the puck remain 
motionless in the inertial fame, while the earth rotates beneath it (thus making oJ m u>). 
The error in this reasoning is that the “level” earth is not spherical, due to the non-radial 
direction of g e ff. If the puck starts out motionless in the inertial frame, it will be drawn 
toward the north pole, due to the component of the gravitational force along the “level” 
surface. In order to not fall toward the pole, the puck needs to travel with frequency u> 
(relative to the inertial frame) in the direction opposite 12 to the earth’s rotation. 13 The 
puck therefore moves at frequency 2u> relative to the frame of the earth, ft 
8. Shape of the earth 

In the reference frame of the earth, the forces on an atom at the surface are: earth’s 
gravity, the centrifugal force, and the normal force from the ground below it. These 
three forces must sum to zero. Therefore, the sum of the gravity plus centrifugal forces 
must be normal to the surface. Said differently, the gravity-plus-centrifugal force must 
have no component along the surface. Said in yet another way, the potential energy 
function derived from the gravity-plus-centrifugal force must be constant along the 
surface. (Otherwise, a piece of the earth would want to move along the surface, which 
would mean we didn’t have the correct surface to begin with.) 

If x is the distance from the earth’s axis, then the centrifugal force is F c = mui 2 x, 
directed outward. The potential energy function for this force is V c = —muj 2 x 2 / 2, 
up to an arbitrary additive constant. The potential energy for the earth’s gravitation 
force is simply mgh. (We’ve arbitrarily chosen the original spherical surface have zero 
potential; any other choice would add on an irrelevant constant. Also, we’ve assumed 
that the slight distortion of the earth won’t make the mgh result invalid. This is true 
to lowest order in h/R, which you can demonstrate if you wish.) 

The equal-potential condition is therefore 

mgh - = C, (9.40) 

12 Of course, the puck could also move with frequency ui in the same direction as the earth’s 
rotation. But in this case, the puck simply sits at one place on the earth. 

13 The reason for this is the following. In the rotating frame of the puck, the puck feels the 
same centrifugal force that it would feel if it were at rest on the earth, spinning with it. The puck 
therefore happily stays at the same 9 value on the “level” surface, just as a puck at rest on the 
earth does. 
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where C is a constant to be determined. Using x = 
ui 2 r 2 sin 2 9 , 


(9.41) 


where B = C/(mg) is another constant. We may replace the r here with the radius 
of the earth, R, with negligible error. 

Depending what the constant B is, this equation describes a whole family of surfaces. 
We may determine the correct value of B by demanding that the volume of the earth 
be the same as it would be in its spherical shape if the centrifugal force were turned 
off. This is equivalent to demanding that the integral of h over the surface of the 
earth is zero. The integral of (a sin 2 6 + b) over the surface of the earth is (the integral 
is easy if we write sin 2 6 as 1 - cos 2 6) 

J (a(l — cos 2 6) + b) 2 -kR 2 sin6d9 = j ( — acos 2 6 + (a + b)) 2 -kR 2 sinOdO 
= 2t tR 2 ^ aco ^ e _ ( a + b) cosfl) [ 

= 2nR 2 (-^+2(a + b)y (9.42) 


Hence, we need b = —(2/3) a for this integral to be zero. Plugging this result into eq. 
(9.41) gives 

h = R^j?- > j( 3 sin 2 <9-2), (9.43) 

as desired. 

9. Changing u>’s direction 

(a) Let Q be the point which lies on the axis of the cone and which is directly above 
P (see Fig. 9.17). If P is a distance r from the origin, and if the half-angle of 
the cone is /?, then Q is a height y = r tan (3 above P. 

Consider the situation an infinitesimal time t later. Let P' be the point that 
is now directly below Q (see Fig. 9.17). The angular speed of the cone is oj, 
so Q moves horizontally at a speed v Q = ioy = car tan 0. Therefore, in the 
infinitesimal time t, we see that Q moves a distance ojyt to the side. 

This distance ojyt is also (essentially) the horizontal distance between P and P'. 
Therefore, a little geometry tells us that P is now a distance 

m = y - Vy 2 -(uyt) 2 « = i(o, 2 y)t 2 (9.44) 

above the table. Since P started on the table with zero speed, this means that P 
is undergoing an acceleration of u> 2 y in the vertical direction. A mass m located 
at P must therefore feel a force Fp = mu> 2 y in the upward direction, if it is to 
remain motionless with respect to the cone. 

(b) The precession frequency fl (which is how fast oj swings around the origin) is 
equal to the speed of Q divided by r. This is true because Q is always directly 
above w, so it moves in a circle of radius r around the 2 -axis. Therefore, L2 
has magnitude v Q /r = ojy/r, and it points in the downward vertical direction 
(for the situation shown in Fig. 9.11). Hence, doj/dt = fl x u> has magnitude 
uJ 2 y/r, and it points in the horizontal direction (out of the page). Therefore, 
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F az = — m(du>/dt ) x r has magnitude mu 2 y, and it points in the downward 
vertical direction. 

A person of mass m at point P therefore interprets the situation as, “I am not 
accelerating with respect to the cone. Therefore, the net force on me is zero. 
And indeed, the upward normal force Fp from the cone, with magnitude mto 2 y, 
is exactly balanced by the mysterious downward force F az , also with magnitude 
mu> 2 y.” 



9.5. SOLUTIONS 
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Chapter 10 

Relativity (Kinematics) 

Copyright 2004 by David Morin, morin@physics.harvard.edu 


We now come to Einstein’s theory of relativity. This is where we find out that 
everything we’ve done so far in this book has been wrong. Well, perhaps “incom¬ 
plete” would be a better word. The important point to realize is that Newtonian 
physics is a limiting case of the more correct Relativistic theory. Newtonian physics 
works perfectly fine when the speeds you are dealing with are much less than the 
speed of light (which is about 3 • 10 8 m/s). It would be silly, to put it mildly, to use 
relativity to solve a problem involving the length of a baseball trajectory. But in 
problems involving large speeds, or in problems where a high degree of accuracy is 
required, you must use the Relativistic theory. This will be the topic of the next 
four chapters. 1 

The theory of Relativity is certainly one of the most exciting and talked-about 
topics in physics. It is well-known for its “paradoxes”, which are quite conducive 
to discussion. There is, however, nothing at all paradoxical about it. The the¬ 
ory is logically and experimentally sound, and the whole subject is actually quite 
straightforward, provided that you proceed calmly and keep a firm hold of your wits. 

The theory rests upon certain postulates. The one that most people find coun¬ 
terintuitive is that the speed of light has the same value in any inertial (that is, 
non-accelerating) reference frame. This speed is much greater than the speed of 
everyday objects, so most of the consequences of this new theory are not noticeable. 
If we lived in a world similar to ours, with the only difference being that the speed 
of light were 100 mph, then the consequences of relativity would be ubiquitous. We 
wouldn’t think twice about time dilations, length contractions, and so on. 

I have included a large number of puzzles and “paradoxes” in the text and in 
the problems. When attacking these, be sure to follow them through to completion, 
and do not say, “I could finish this one if I wanted to, but all I’d have to do would 
be such-and-such, so I won’t bother,” because the essence of the paradox may very 

1 At any rate, you shouldn’t feel too bad about having spent so much time learning about a 
theory that is simply the limiting case of another theory, because you’re now going to do it again. 
Relativity is also the limiting case of yet another theory (quantum field theory). And likewise, you 
shouldn’t feel too bad about spending so much time on relativity, because quantum field theory is 
also the limiting case of yet another theory (string theory). And likewise... well, you get the idea. 
It just might actually be turtles all the way down. 
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well be contained in the “such-and-such”, and you will have missed out on all the 
fun. Most of the paradoxes arise because different frames of reference seem to give 
different answers. Therefore, in explaining a paradox, you not only have to give the 
correct reasoning; you also have to say what’s wrong with incorrect reasoning. 

There are two main topics in relativity. One is Special Relativity (which does 
not deal with gravity), and the other is General Relativity (which does). We will 
deal mostly with the former, but Chapter 13 contains some of the latter. 

Special Relativity may be divided into two topics, kinematics and dynamics. 
Kinematics deals with lengths, times, speeds, etc. It is basically concerned with 
only the space and time coordinates of an abstract particle, and not with masses, 
forces, energy, momentum, etc. Dynamics, on the other hand, does deal with these 
quantities. 

This chapter will cover kinematics. Chapter 11 will cover dynamics. Most of 
the fun paradoxes fall into the kinematics part, so the present chapter will be the 
longer of the two. In Chapter 12, we will introduce the concept of 4-vectors, which 
ties much of the material in Chapters 10 and 11 together. 

10.1 The postulates 

Various approaches can be taken in deriving the consequences of the Special Rel¬ 
ativity theory. Different approaches use different postulates. Some start with the 
invariance of the speed of light in any inertial frame. Others start with the invariant 
interval (discussed in Section 10.4). Others start with the invariance of the inner 
product of 4-momentum vectors (discussed in Chapter 12). Postulates in one ap¬ 
proach are theorems in another. There is no “good” or “bad” route to take; they 
are all equally valid. However, some approaches are simpler and more intuitive (if 
there is such a thing as intuition in relativity) than others. I will choose to start 
with the speed-of-light postulate: 

• The speed of light has the same value in any inertial frame. 

I do not claim that this statement is obvious, or even believable. But I do claim 
that it is easy to understand what the statement says (even if you think it’s too 
silly to be true). It says the following. Consider a train moving along the ground at 
constant velocity (that is, it is not accelerating; this is the definition of an inertial 
frame). Someone on the train shines a light from one point on the train to another. 
Let the speed of the light with respect to the train be c (« 3 • 10 8 m/s). Then the 
above postulate says that a person on the ground also sees the light move at speed 
c. 

This is a rather bizarre statement. It does not hold for everyday objects. If a 
baseball is thrown on a train, then the speed of the baseball is of course different 
in different frames. The observer on the ground must add the velocity of the train 
and the velocity of the ball (with respect to the train) to obtain the velocity of the 
ball with respect to the ground. 2 

2 Actually, this isn’t quite true, as the velocity-addition result in Section 10.3.3 shows. But it’s 
true enough for the point we are making here. 
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The truth of the speed-of-light postulate cannot be demonstrated from first 
principles. No statement with any physical content in physics (that is, one that isn’t 
purely mathematical, such as, “two apples plus two apples gives four apples”) can be 
proven. In the end, we must rely on experiment. And indeed, all the consequences 
of the speed-of-light postulate have been verified countless times during this century. 
In particular, the consequences are being verified continually each day in high-energy 
particle accelerators, where elementary particles reach speeds very close to c. 

The most well-known of the early experiments on the speed of light was the one 
performed by Michelson and Morley, who in 1887 tried to measure the effect of the 
earth’s motion on the speed of light. If light moves at speed c with respect to only 
one special frame (the frame of the “ether”, analogous to the way sound travels 
through air), then the speed of light in a given direction on the earth should be 
faster or slower than c, depending on which way the earth is moving through the 
ether. In particular, if the speed of light in a given direction is measured at one 
time, and then measured again six months later, then the results should be different, 
due to the earth’s motion around the sun. Michelson and Morley were not able to 
measure any such differences in the speed of light. Nor has anyone else been able 
to do so. The conclusion is that the ether simply does not exist. Light does not 
require a medium to support its motion. 

The findings of Michelson-Morley 
Allow us to say very surely, 

“If this ether is real, 

Then it has no appeal, 

And shows itself off rather poorly.” 

The collection of all the data from various experiments over many years allows us 
to conclude with reasonable certainty that our starting assumption of an invariant 
speed of light is correct (or is at least the limiting case of a more accurate theory). 

There is one more postulate in the Special Relativity theory, namely the “Rel¬ 
ativity” postulate. It is much more believable than the speed-of-light postulate, so 
you might just take it for granted and forget to consider it. But like any postulate, 
of course, it is crucial. It can be stated in various ways, but we’ll simply word it as: 

• All inertial frames are “equivalent”. 

This postulate basically says that one inertial frame is no better than any an¬ 
other. There is no preferred reference frame. That is, it makes no sense to say 
that something is moving; it only makes sense to say that one thing is moving with 
respect to another. This is where the “Relativity” in Special Relativity comes from. 
There is no absolute frame; the motion of any frame is only defined relative to other 
frames. 

This postulate also says that if the laws of physics hold in one inertial frame 
(and presumably they do hold in the frame in which I now sit), 3 then they hold 

technically, the earth is spinning while revolving around the sun, and there are also little 
vibrations in the floor beneath my chair, etc., so I’m not really in an inertial frame. But it’s close 
enough for me. 
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in all others. It also says that if we have two frames S and S', then S should see 
things in S' in exactly the same way that S' sees things in S (because we could 
simply switch the labels of S and S '). It also says that empty space is homogeneous 
(that is, all points look the same), because we could pick any point to be, say, the 
origin of a coordinate system. It also says that empty space is isotropic (that is, all 
directions look the same), because we could pick any axis to be, say, the x-axis of a 
coordinate system. 

Unlike the first postulate, this one is entirely reasonable. We’ve gotten used to 
having no special places in the universe. We gave up having the earth as the center, 
so let’s not give any other point a chance, either. 

Copernicus gave his reply 
To those who had pledged to deny. 

“All your addictions 

To ancient convictions 

Won’t bring back your place in the sky.” 

Everything we’ve said here about our second postulate refers to empty space. If 
we have a chunk of mass, then there is certainly a difference between the position 
of the mass and a point a meter away. To incorporate mass into the theory, we 
would have to delve into General Relativity. But we won’t have anything to say 
about that in this chapter. We will deal only with empty space, containing perhaps 
a few observant souls sailing along in rockets or floating aimlessly on little spheres. 
Though it may sound boring at first, it will turn out to be more exciting than you’d 
think. 


10.2 The fundamental effects 
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Figure 10.1 
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Figure 10.2 


The most striking effects of our two postulates are: (1) the loss of simultaneity, (2) 
length contraction, and (3) time dilation. In this section, we will discuss these three 
effects using some time-honored concrete examples. In the following section, we will 
derive the Lorentz transformations using these three results. 

10.2.1 Loss of Simultaneity 

Consider the following setup. In A’s reference frame, a light source is placed midway 
between two receivers, a distance £' from each (see Fig. 10.1). The light source 
emits a flash. From A’s point of view, the light hits the two receivers at the same 
time, d/<: seconds after the flash. Now consider another observer, B. who travels 
by to the left at speed v. From her point of view, does the light hit the receivers at 
the same time? We will show that it does not. 

In B’s reference frame, the situation looks like that in Fig. 10.2. The receivers 
(along with everything else in A’s frame) are moving to the right at speed v. and the 
light is traveling in both directions at speed c with respect to B (not with respect to 
the light source, as measured in B 's frame; this is where the speed-of-light postulate 






10.2. THE FUNDAMENTAL EFFECTS 


X-5 


comes into play). Therefore, the relative speed (as viewed by B ) of the light and 
the left receiver is c + v, and the relative speed of the light and the right receiver is 
c — v. 4 

Let £ be the distance from the source to the receivers, as measured by B . 5 Then 
in B 's frame, the light hits the left receiver at ti and the right receiver at t r . where 


tl 


i 

c + v ’ 


t r 


£ 

c — v 


( 10 . 1 ) 


These are not equal if v ^ 0. 6 

The moral of this exercise is that it makes no sense to say that one event happens 
at the same time as another, unless you state which frame you’re talking about. 
Simultaneity depends on the frame in which the observations are made. 


Of the many effects, miscellaneous, 
The loss of events, simultaneous, 
Allows A to claim 
There’s no pause in B's frame, 


Remarks: 

1. The invariance of the speed of light was used in saying that the two relative speeds 
above were c + v and c — v. If we were talking about baseballs instead of light 
beams, then the relative speeds wouldn’t look like this. If v b is the speed at which 
the baseballs are thrown in A’s frame, then B sees the balls move at speeds v b — v to 
the left and v b + v to the right. 7 These are not equal to v b , as they would be in the 
case of the light beams. The relative speeds between the balls and the left and right 
receivers are therefore (v b — v) + v = v b and (v b + v) — v = v b . These are equal, so B 
sees the balls hit the receivers at the same time, as we know very well from everyday 
experience. 

2. It is indeed legal in eq. (10.1) to obtain the times by simply dividing l by the relative 
speeds, 6 + v and c—v. But if you want a more formal method, then consider the 
following. In B's frame, the position of the right photon is given by ct, and the position 

4 Yes, it is legal to simply add and subtract these speeds to obtain the relative speeds as viewed 
by B. If v equals, say, 2 • 10 8 m/s, then in one second the left receiver will move 2 • 10 8 m to the 
right, while the left ray of light will move 3 • 10 s m to the left. This means that they will now be 

5 • 10 s m closer than they were a second ago. In other words, the relative speed (as measured by B ) 
is 5 • 10® m/s, which is simply c + v. Note that this implies that it is perfectly legal for the relative 
speed of two things, as measured by a third, to take any value up to 2c. Both the v and c here are 
measured with respect to the same person, namely B, so our intuition works fine. We don’t need 
to use the “velocity-addition formula”, which we’ll derive in Section 10.3.3, and which is relevant in 
a different setup. I include this footnote here just in case you’ve seen the velocity-addition formula 
and think it is relevant in this setup. But if it didn’t occur to you, then never mind. 

5 We will see in Section 10.2.3 that l is not equal to (!, due to length contraction. But this 
won’t be important here. The only thing we assume now is that the light source is equidistant 
from the receivers. This follows from the fact that space is homogeneous, which implies that the 
length-contraction factor must be the same everywhere. More on this in Section 10.2.3. 

6 The one exception is when t = 0, in which case the two events happen at the same place and 
same time in all frames. 

7 The velocity-addition formula in Section 10.3.3 shows that these formulas aren’t actually cor¬ 

rect. But they’re close enough for our purposes here. 
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of the right receiver (which had a head start of €) is given by £ + vt. Equating these 
two positions gives t r = l/(c — v). Likewise for the left photon. 

3. There is always a difference between the time an event happens and the time someone 
sees the event happen, because light takes time to travel from the event to the observer. 
What we calculated above were the times at which the events really happen. We could, 
of course, calculate the times at which B sees the events occur, but such times are 
rarely important, and in general we will not be concerned with them. They can simply 
be calculated by adding on a (distance)/c time difference for the path of the photons 
to B’s eye. Of course, if B did the above experiment to find t r and tj, she would do 
it by writing down the times at which she saw the events occur, and then subtracting 
off the relevant (distance) /c time differences to find when the events really happened. 
To sum up, the t r A U result in eq. (10.1) is due to the fact that the events truly 
occur at different times. It has nothing to do with the time it takes light to travel to 
your eye. In this chapter, we will sometimes use sloppy language of the sort, “What 
time does Beth see event Q happen?” But we don’t really mean, “When do Beth’s 
eyes register that Q happened?” Instead, we mean, “What time does Beth know that 
event Q happened in her frame?” If we ever want to use “see” in the former sense, 
we will explicitly say so (as in Section 10.6 on the Doppler effect), £ 

Where this last line is not so extraneous. 
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Figure 10.3 
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Figure 10.4 


Example (The head start): Two clocks are positioned at the ends of a train of 
length L (as measured in its own frame). They are synchronized in the train frame. 
The train travels past you at speed v. It turns out that if you observe the clocks at 
simultaneous times in your frame, you will see the rear clock showing a higher reading 
than the front clock (see Fig. 10.3). By how much? 

Solution: As above, let’s put a light source on the train, but let’s now position it 
so that the light hits the clocks at the ends of the train at the same time in your 
frame. As above, the relative speeds of the photons and the clocks are c + v and 
c — v (as viewed in your frame). We therefore need to divide the train into lengths in 
this ratio, in your frame. But since length contraction (discussed in Section 10.2.3) 
is independent of position, this must also be the ratio in the train frame. So in the 
train frame, you can quickly show that two numbers that are in this ratio, and that 
add up to L, are L(c + v)/2c and L(c — v)/2c. 

The situation in the train frame therefore looks like that in Fig. 10.4 The light must 
travel an extra distance of L(c + v)/2c — L(c — v)/2c = Lv/c to reach the rear clock. 
The extra time is therefore Lv/c 2 . Hence, the rear clock reads Lv/c 2 more when it 
is hit by the backward photon, compared to what the front clock reads when it is hit 
by the forward photon. 

Now, let the instant you look at the clocks be the instant the photons hit them (that’s 
why we chose the hittings to be simultaneous in your frame). Then you observe the 
rear clock reading more than the front clock by an amount, 

Lv 

(difference in readings) = . (10-2) 

Note that the L that appears here is the length of the train in its own frame, and not 
the shortened length that you observe in your frame (see Section 10.2.3). 
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Remarks: The fact that the rear clock is ahead of the front clock in your frame means that 
the light hits the rear clock after it hits the front clock in the train frame. 

Note that our result does not say that you see the rear clock ticking at a faster rate than the 
front clock. They run at the same rate. (Both have the same time-dilation factor relative to 
you; see Section 10.2.2.) The back clock is simply a fixed time ahead of the front clock, as 
seen by you. 

It’s easy to forget which of the clocks is the one that’s ahead. But a helpful mnemonic for 
remembering “rear clock ahead” is that both the first and fourth letters in each word form 
the same acronym, “rca,” which is an anagram for “car,” which is sort of like a train. Sure. 

* 


10.2.2 Time dilation 

We present here the classic example of a light beam traveling vertically on a train. 
Let there be a light source on the floor of the train and a mirror on the ceiling, 
which is a height h above the floor. Let observer A be on the train, and observer B 
be on the ground. The speed of the train with respect to the ground is v. 8 A flash 
of light is emitted. The light travels up to the mirror and then back down to the 
source. 

In A’s frame, the train is at rest. The path of the light is shown in Fig. 10.5. It 
takes the light a time h/c to reach the ceiling and then a time h/c to return to the 
source. The roundtrip time is therefore 


In B s frame, the train moves at speed v. The path of the light is shown in 
Fig. 10.6. The crucial fact to remember is that the speed of light in B's frame is still 
c. This means that the light travels along its diagonally upward path at speed c. 
(The vertical component of the speed is not c, as would be the case if light behaved 
like a baseball.) Since the horizontal component of the light’s velocity is v, 9 the 
vertical component must be 5 r(? —viMI, as shown in Fig. 10.7. 10 The time it takes 
to reach the mirror is therefore h/y <c? — u 2 , 11 so the roundtrip time is 



technically, the words, “with respect to...should always be included when talking about 
speeds, because there is no absolute reference frame, and hence no absolute speed. But in the 
future, when it is clear what we mean (as in the case of a train moving with respect to the ground), 
we will occasionally be sloppy and drop the “with respect to....” 

9 Yes, it is still v. The light is always located on the vertical line between the source and the 
mirror. Since both of these move horizontally at speed v, the light does also. 

10 The Pythagorean theorem is indeed valid here. It is valid for distances, and since speeds are 
simply distances divided by time, it is also valid for speeds. 

11 We’ve assumed that the height of the train in B’s frame is still h. Although we will see in 
Section 10.2.3 that there is length contraction along the direction of motion, there is none in the 
direction perpendicular to the motion (see Problem 1). 
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Dividing eq. (10.4) by eq. (10.3) gives 


t B = it A , 


(10.5) 


where 


1 

7= v/l-oVc 2 ’ 


( 10 . 6 ) 


This 7 factor is ubiquitous in special relativity. Note that it is always greater than 
or equal to 1. This means that the roundtrip time is longer in B's frame than in 
A’s frame. 

What are the implications of this? For concreteness, let v/c = 3/5, which 
gives 7 = 5/4. Then we may say the following. If A is standing next to the light 
source, and if B is standing on the ground, and if A claps his hands at t A = 4 
second intervals, then B will observe claps at t B = 5 second intervals (after having 
subtracted off the time for the light to travel to her eye, of course). This is true 
because both A and B must agree on the number of roundtrips the light beam takes 
between claps. Assuming, for convenience, that a roundtrip takes one second in A’s 
frame, the four roundtrips between claps will take five seconds in F?’s frame, using 
eq. (10.5). And if we have a train that does not contain one of our special clocks, 
that’s no matter. We could have built one if we wanted to, so the same results 
concerning the claps must hold. 

Therefore, B will observe A moving strangely slowly. B will observe A’s heart¬ 
beat beating slowly; his blinks will be a bit lethargic; and his sips of coffee will be 
slow enough to suggest that he needs another cup. 


The effects of dilation of time 
Are magical, strange, and sublime. 

In your frame, this verse, 

Which you’ll see is not terse, 

Can be read in the same amount of time it takes someone 
else in another frame to read a similar sort of rhyme. 


Note that we can make these conclusions only if A is at rest with respect to the 
train. If A is moving with respect to the train, then eq. (10.5) does not hold, because 
we cannot say that both A and B must agree on the number of roundtrips the light 
beam takes between claps, because of the problem of simultaneity. It cannot be said 
which flash of light happens at the same time as a clap. Such a statement depends 
on the frame. 

Remarks: 

1. The time dilation result derived in eq. (10.5) is a bit strange, no doubt, but there 
doesn’t seem to be anything downright incorrect about it until we look at the situation 
from A’s point of view. A sees B flying by at a speed v in the other direction. The 
ground is no more fundamental than a train, so the same reasoning applies. The time 
dilation factor, 7 , doesn’t depend on the sign of v, so A sees the same time dilation 
factor that B sees. That is, A sees R’s clock running slow. But how can this be? Are 
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we claiming that A’s clock is slower than B’s, and also that B’s clock is slower than 
A’s? Well... yes and no. 

Remember that the above time-dilation reasoning applies only to a situation where 
something is motionless in the appropriate frame. In the second situation (where A 
sees B flying by), the statement t A = 7 t B holds only when the events happen at the 
same place in B’ s frame. But for two such events, they are not in the same place in 
A’s frame, so the t B = ~jt A result of eq. (10.5) does not hold. The conditions of being 
motionless in each frame never both hold for a given setup (unless v = 0 , in which 
case 7=1 and t A = t B ). So, the answer to the question at the end of the previous 
paragraph is “yes” if you ask the questions in the appropriate frames, and “no” if you 
think the answer should be frame independent. 

2. Concerning the fact that A sees B’s clock run slow, and B sees A’s clock run slow, 
consider the following statement. “This is a contradiction. It is essentially the same 
as saying, ‘I have two apples on a table. The left one is bigger than the right one, 
and the right one is bigger than the left one.’ ” How would you respond to this? 
Well, it is not a contradiction. Observers A and B are using different coordinates to 
measure time. The times measured in each of their frames are quite different things. 
The seemingly contradictory time-dilation result is really no stranger than having two 
people run away from each other into the distance, and having them both say that 
the other person looks smaller. 

In short, we are not comparing apples and apples. We are comparing apples and 
oranges. A more correct analogy would be the following. An apple and an orange sit 
on a table. The apple says to the orange, “You are a much uglier apple than I am,” 
and the orange says to the apple, “You are a much uglier orange than I am.” 


3. One might view the statement, “A sees B’s clock running slowly, and also B sees A’s 
clock running slowly,” as somewhat unsettling. But in fact, it would be a complete 
disaster for the theory if A and B viewed each other in different ways. A critical 
postulate in the theory of relativity is that A sees B in exactly the same way as B 
sees A. * 


Example (Twin paradox): Twin A stays on the earth, while twin B flies quickly 
to a distant star and back (see Fig. 10.8). Show that B is younger when they meet 
up again. 



earth star 


Figure 10.8 


Solution: From A’s point of view, B’s clock is running slow by a factor 7 , on both 
the outward and return parts of the trip. Therefore, B is younger when they meet up 
again. 

That’s all there is to it. But although the above reasoning is correct, it leaves one 
large point unaddressed. The “paradox” part of this problem’s title comes from the 
following reasoning. You might say that in B’s frame, A’s clock is running slow by a 
factor 7 , and so A is younger when they meet up again. 

It is definitely true that when the two twins are standing next to each other (that 
is, when they are in the same frame), we can’t have both B younger than A, and A 
younger than B. So what is wrong with the reasoning in the previous paragraph? 
The error lies in the fact that there is no “one frame” that B is in. The inertial 
frame for the outward trip is different from the inertial frame for the return trip. The 
derivation of our time-dilation result applies only to one inertial frame. 
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Said in a different way, B accelerates when she turns around, and our time-dilation 
result holds only from the point of view of an inertial observer. 12 The symmetry in 
the problem is broken by the acceleration. If both A and B are blindfolded, they 
can still tell who is doing the traveling, because B will feel the acceleration at the 
turnaround. Constant velocity cannot be felt, but acceleration can be. (However, see 
Chapter 13 on General Relativity. Gravity complicates things.) 

The above paragraphs show what is wrong with the “A is younger” reasoning, but 
it doesn’t show how to modify it quantitatively to obtain the correct answer. There 
are many different ways of doing this, and I’ll let you tackle some of these in the 
problems. Also, Appendix K gives a list of all the possible resolutions to the twin 
paradox that I can think of. 


Example (Muon decay): Elementary particles called muons (which are identical 
to electrons, except that they are about 200 times as massive) are created in the upper 
atmosphere when cosmic rays collide with air molecules. The muons have an average 
lifetime of about 2 • 10 -6 seconds 13 (then they decay into electrons, neutrinos, and 
the like), and move at nearly the speed of light. 

Assume for simplicity that a certain muon is created at a height of 50 km, moves 
straight downward, has a speed v = .99998 c, decays in exactly T = 2 • 10 -6 seconds, 
and doesn’t collide with anything on the way down. 14 Will the muon reach the earth 
before it decays? 

Solution: The naive thing to say is that the distance traveled by the muon is 
d = vT ss (3 • 10 8 m/s)(2 • 10 _6 s) = 600m, and that this is less than 50km, so the 
muon does not reach the earth. This reasoning is incorrect, because of the time- 
dilation effect. The muon lives longer in the earth frame, by a factor of 7, which is 
7 = l/y'T^IP160 here. The correct distance traveled in the earth frame is 
therefore v(yT) « 100 km. Hence, the muon travels the 50 km, with room to spare. 
The real-life fact that we actually do detect muons reaching the surface of the earth 
in the predicted abundances (while the naive d = vT reasoning would predict that we 
shouldn’t see any) is one of the many experimental tests that support the relativity 
theory. 


10.2.3 Length contraction 

Consider the following setup. Person A stands on a train which he measures to 
have length , and person B stands on the ground. A light source is located at the 

12 For the entire outward and return parts of the trip, B does observe A’s clock running slow, 
but enough strangeness occurs during the turning-around period to make A end up older. Note, 
however, that a discussion of acceleration is not required to quantitatively understand the paradox, 
as Problem 2 shows. 

13 This is the “proper” lifetime. That is, the lifetime as measured in the frame of the muon. 

14 In the real world, the muons are created at various heights, move in different directions, have 
different speeds, decay in lifetimes that vary according to a standard half-life formula, and may 
very well bump into air molecules. So technically we’ve got everything wrong here. But that’s no 
matter. This example will work just fine for the present purpose. 
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back of the train, and a mirror is located at the front. The train moves at speed 
v with respect to the ground. The source emits a flash of light which heads to the 
mirror, bounces off, then heads back to the source. By looking at how long this 
process takes in the two reference frames, we can determine the length of the train, 
as viewed by B. 15 

In T’s frame (see Fig. 10.9), the round-trip time for the light is simply 


u 


A's frame 


t A = 


2T_ 

c 


(10.7) 


Figure 10.9 


Things are a little more complicated in B 's frame (see Fig. 10.10). Let the 
length of the train, as viewed by B, be l. For all we know at this point, i may 
equal l', but we will soon find that it does not. The relative speed of the light and 
the mirror during the first part of the trip is c — v. The relative speed during the 
second part is c + v. During each part, the light must close a gap with initial length 
i. Therefore, the total round-trip time is 


l l _ 2lc _ 21 2 

c — v c + v c 2 — V 2 c 7 


( 10 . 8 ) 


But we know from eq. (10.5) that 


t B = it A . 


(10.9) 


Substituting the results for t A and t B from eqs. (10.7) and (10.8) into eq. (10.9), 
we find 



( 10 . 10 ) 



Figure 10.10 


Since 7 > 1, we see that B measures the train to be shorter than A measures. 

Note that we could not have used this setup to find the length contraction if we 
had not already found the time dilation in eq. (10.5). 

The term proper length is used to describe the length of an object in its rest 
frame. So £' is the proper length of the above train. 


Relativistic limericks have the attraction 
Of being shrunk by a Lorentz contraction. 
But for readers, unwary, 

The results may be scary, 

When a fraction... 


Remarks: 

1. The length-contraction result in eq. (10.10) is for lengths along the direction of the 
relative velocity. There is no length contraction in the perpendicular direction, as 
shown in Problem 1. 

ls The second remark below gives another (quicker) derivation of length contraction. But we’ll 
go through with the present derivation since the calculation is instructive. 
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Figure 10.12 


2 . As with time dilation, this length contraction is a bit strange, but there doesn’t seem 
to be anything actually paradoxical about it, until we look at things from A’s point 
of view. To make a nice symmetrical situation, let’s say B is standing on an identical 
train, which is motionless with respect to the ground. A sees B flying by at speed v 
in the other direction. Neither train is any more fundamental than the other, so the 
same reasoning applies, and A sees the same length contraction factor that B sees. 
That is, A measures B’s train to be short. 

But how can this be? Are we claiming that A’s train is shorter than B’s, and also 
that B’s train is shorter than A’s? Does the situation look like Fig. 10.11, or does it 
look like Fig. 10.12? Well... it depends. 

The word “is” in the above paragraph is a very bad word to use, and is generally 
the cause of all the confusion. There is no such thing as “is-ness” when it comes to 
lengths. It makes no sense to say what the length of the train really is. It only makes 
sense to say what the length is in a given frame. The situation doesn’t really look like 
one thing in particular. The look depends on the frame in which the looking is being 
done. 

Let’s be a little more specific. How do you measure a length? You write down the 
coordinates of the ends of something measured simultaneously , and then you take the 
difference. But the word “simultaneous” here should send up all sorts of red flags. 
Simultaneous events in one frame are not simultaneous events in another. 

Stated more precisely, here is what we are really claiming: Let B write down simul¬ 
taneous coordinates of the ends of A’s train, and also simultaneous coordinates of the 
ends of her own train. Then the difference between the former is smaller than the 
difference between the latter. Likewise, let A write down simultaneous coordinates 
of the ends of B’s train, and also simultaneous coordinates of the ends of his own 
train. Then the difference between the former is smaller than the difference between 
the latter. There is no contradiction here, because the times at which A and B are 
writing down the coordinates don’t have much to do with each other, due to the loss 
of simultaneity. As with time dilation, we are comparing apples and oranges. 

3. There is an easy argument to show that time dilation implies length contraction, and 
vice versa. Let B stand on the ground, next to a stick of length £. Let A fly past 
the stick at speed v. In B’s frame, it takes A a time of £/v to traverse the length of 
the stick. Therefore (assuming that we have demonstrated the time-dilation result), 
a watch on A’s wrist will advance by a time of only £ / 71 : while he traverses the length 
of the stick. 

How does A view the situation? He sees the ground and the stick fly by with speed v. 
The time between the two ends passing him is £/ / yv (since that is the time elapsed on 
his watch). To get the length of the stick in his frame, he simply multiplies the speed 
times the time. That is, he measures the length to be (i/'yvjv = f/ 7 , which is the 
desired contraction. The same argument also shows that length contraction implies 
time dilation. 

4. As mentioned earlier, the length contraction factor 7 is independent of position. That 
is, all parts of the train are contracted by the same amount. This follows from the 
fact that all points in space are equivalent. Equivalently, we could put a number of 
small replicas of the above source-mirror system along the length of the train. They 
would all produce the same value for 7 , independent of the position on the train. £ 
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Example: Two trains, A and B, each have proper length L and move in the same 
direction. A’s speed is 4c/5, and B’s speed is 3c/5. A starts behind B (see Fig. 10.13). 
How long, as viewed by person C on the ground, does it take for A to overtake B1 
By this we mean the time between the front of A passing the back of B, and the back 
of A passing the front of B. 


Solution: Relative to C on the ground, the 7 factors associated with A and B are 
5/3 and 5/4, respectively. Therefore, their lengths in the ground frame are 3T/5 and 
4L/5. While overtaking B, A must travel farther than B, by an excess distance equal 
to the sum of the lengths of the trains, which is 7L/5. The relative speed of the two 
trains (as viewed by C on the ground) is the difference of the speeds, which is c/5. 
The total time is therefore 


t _ 7&/S _ 71 
c/5 c 


( 10 . 11 ) 


2 C 

Figure 10.13 


Example (Muon decay, again) : Consider the “Muon decay” example from Section 
10.2.2. From the muon’s point of view, it lives for a time of T = 2 • 10 -6 seconds, and 
the earth is speeding toward it at v = .99998c. How, then, does the earth (which will 
travel only d= vT ss 600 m before the muon decays) reach the muon? 

Solution: The important point here is that in the muon’s frame, the distance 
to the earth is contracted by a factor 7 ss 160. Therefore, the earth starts only 
50km/160 « 300 m away. Since the earth can travel a distance of 600 m during the 
muon’s lifetime, the earth collides with the muon, with time to spare. 

As stated in the third remark above, time dilation and length contraction are inti¬ 
mately related. We can’t have one without the other. In the earth’s frame, the muon’s 
arrival on the earth is explained by time dilation. In the muon’s frame, it is explained 
by length contraction. 

Observe that for muons, created, 

The dilation of time is related 
To Einstein’s insistence 
Of shrunken-down distance 
In the frame where decays aren’t belated. 


This concludes our treatment of the three fundamental effects. In the next 
section, we’ll combine all the information we’ll gained and use it to derive the Lorentz 
transformations. 

In everything we’ve done so far, we’ve taken the route of having observers sitting 
in various frames, making various measurements. But as mentioned above, this can 
cause some ambiguity, because you might think that the time when someone actually 
sees something is important, whereas what we are generally concerned with is the 
time when something actually happens. 

A way to avoid this ambiguity is to remove the observers and simply fill up space 
with a large rigid lattice of meter sticks and synchronized clocks. Different frames 
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y y' 



Figure 10.14 


are defined by different lattices. All of the meter sticks in a given frame are at rest 
with respect to all the others, so there is no issue of length contraction within each 
frame. To measure the length of something, we simply need to determine where 
the ends are (at simultaneous times, as measured in that frame) with respect to the 
lattice. 

As far as the synchronization of the clocks within each frame goes, this can 
be accomplished by putting a light source midway between any two of them and 
sending out a signal, and then setting the clocks to a certain value when the signal 
hits them. Alternatively, a more straightforward method of synchronization is to 
start with all the clocks synchronized right next to each other, and to then move 
them very slowly to their final positions. Any time-dilation effects can be made 
arbitrarily small by moving the clocks sufficiently slowly. 

This lattice way of looking at things emphasizes that observers are not impor¬ 
tant, and that a frame is defined simply as a lattice of space and time coordinates. 
Anything that happens (an “event”) is automatically assigned a space and time 
coordinate in every frame, independent of any observer. The concept of an “event” 
will be very important in the next section. 


10.3 The Lorentz transformations 

10.3.1 The derivation 

Consider a coordinate system, S', moving relative to another system, S (see Fig. 10.14). 
Let the constant relative speed of the frames be v. Let the corresponding axes of S 
and S' point in the same direction, and let the origin of S’ move along the x-axis 
of S, in the positive direction. Nothing exciting happens in the y and 2 directions 
(see Problem 1), so we’ll ignore them. 

Our goal in this section is to look at two events (an event is simply anything 

that has space and time coordinates) in spacetime and relate the Ax and At of 

the coordinates in one frame to the Ax 1 and At 1 of the coordinates in another. We 
therefore want to find the constants A, B, C. and D in the relations, 

Ax = AAx' + BAt', 

At = CAt’ + DAx’. (10.12) 

The four constants here will end up depending on v (which is constant, given the 
two inertial frames). But we will not explicitly write this dependence, for ease of 
notation. 

Remarks: 

1. We have assumed in eq. (10.12) that Ax and At are linear functions of Ax' and At'. 
And we have also assumed that A, B, C, and D are constants (that is, dependent 
only on v, and not on x,t,x',t'). 

The first of these assumptions is justified by the fact that any finite interval can be 
built up from a series of many infinitesimal ones. But for an infinitesimal interval, 
any terms such as, for example, (At') 2 , are negligible compared to the linear terms. 
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Therefore, if we add up all the infinitesimal intervals to obtain a finite one, we will be 
left with only the linear terms. Equivalently, it shouldn’t matter whether we make a 
measurement with, say, meter sticks or half-meter sticks. 

The second assumption can be justified in various ways. One is that all inertial frames 
should agree on what “non-accelerating” motion is. That is, if Ax' = v! At', then 
we should also have Ax = it A t, for some constant it. This is true only if the above 
coefficients are constants. Another justification comes from the second of our two 
relativity postulates, which says that all points in (empty) space are indistinguishable. 
With this in mind, let us assume that we have a transformation of the form, say, 
Ax = A Ax' + B At' + Ex' Ax'. The x' in the last term implies that the absolute 
location in spacetime (and not just the relative position) is important. Therefore, this 
last term cannot exist. 

2. If the relations in eq. (10.12) turned out to be the usual Galilean transformations 
(which are the ones that hold for everyday relative speeds, v) then we would have 
Ax = Ax'+v At, and At = At' (that is, A = C = 1, B = v, and D = 0). We will find, 
however, under the assumptions of Special Relativity, that this is not the case. The 
Galilean transformations are not the correct transformations. But we will show below 
that the correct transformations do indeed reduce to the Galilean transformations in 
the limit of slow speeds, as they must. £ 

The constants A, B, C, and D in eq. (10.12) are four unknowns, and we can 
solve for them by using four facts we found above in Section 10.2. These four facts 
we will use are: 


effect | condition | result | eq. in text 


1 

Time dilation 

x 1 = 0 

t = 

(10.5) 

2 

Length contraction 

t! = 0 

x ' = x /7 

( 10 . 10 ) 

3 

Relative v of frames 

x = 0 

x' = —vtf 


4 

“Head-start” 

t = 0 

t 1 = —vx'/c 2 

( 10 . 2 ) 


We have taken the liberty of dropping the A’s in front of the coordinates, lest 
things get too messy. We will continue to omit the A’s in what follows, but it 
should be understood that x really means Ax, etc. We are always concerned with 
the difference between coordinates of two events in spacetime. The actual value of 
any coordinate is irrelevant, because there is no preferred origin in any frame. 

You should pause for a moment and verify that the four “results” in the above 
table are in fact the proper mathematical expressions for the four effects, given the 
stated “conditions .” 16 My advice is to continue pausing until you are comfortable 
with the conditions for time dilation and length contraction, discussed in various 
remarks above. Note also that the sign in the “Head-start” effect is indeed correct, 
because the front clock shows less time than the rear clock. So the clock with the 
higher x' value is the one with the lower t' value. 

We can now use our four facts in the above table to quickly solve for the un¬ 
knowns A, B, C, and D in eq. (10.12). 

16 Of course, we can state the effects in other ways by switching the primes and unprimes. For 
example, time dilation can be written as “t 1 = 7 1 when x = 0”. But we’ve chosen the above ways 
of writing things because they will allow us to solve for the four unknowns in the quickest way. 






X-16 


CHAPTER 10. RELATIVITY (KINEMATICS) 


Fact (1) gives C = 7. 

Fact (2) gives A = 7. 

Fact (3) gives B/A = v => B = jv. 

Fact (4) gives D/C = v/<? =h D = ^v/c 2 . 

Eqs. (10.12), which are known as the Lorentz transformations , are therefore given 
by 17 

Ax = ^(Ax' + vA t'), 

At = 7( At' + v Ax'/c 2 ), 

Ay = Ay', 

A z = A z', (10.13) 

where 



We have tacked on the trivial transformations for y and z, but we won’t bother 
writing these in the future. 

If we solve for x' and t' in terms of x and t in eq. (10.13), then we see that the 
inverse Lorentz transformations are given by 

x' = 7 (x — vt), 

t' = 7 (t — vx/c 2 ). (10.15) 

Of course, which ones are the “inverse” transformations depends simply on your 
point of view. But it’s intuitively clear that the only difference between the two sets 
of equations is the sign of v, because S is simply moving backwards with respect to 
S'. 

The reason why the derivation of eqs. (10.13) was so quick is that we already 
did most of the work in Section 10.2, when we derived the fundamental effects. If 
we wanted to derive the Lorentz transformations from scratch, that is, by starting 
with the two postulates in Section 10.1, then the derivation would be much longer. 
In Appendix I we give such a derivation, where it is clear what information comes 
from each of the postulates. The procedure there is somewhat cumbersome, but 
it’s worth taking a look at, because we will invoke the results in a very cool way in 
Section 10.8. 

Remarks: 

1. In the limit v c, eqs. (10.13) reduce to x = x' + vt and t = t’, that is, simply 
the Galilean transformations. This must be the case, because we know from everyday 
experience (where ti<c) that the Galilean transformations work just fine. 

17 This will be the last time we’ll write the A’s, but remember that they’re always really there. 
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2. Eqs. (10.13) exhibit a nice symmetry between x and ct. With /3 = v/c, they become 
x = 7 (a:' + /3(cf')), 

ct = + /3x'). (10.16) 


Equivalently, in units where c = 1 (for example, where one unit of distance equals 
3 • 10 8 meters, or where one unit of time equals 1/(3 • 10 8 ) seconds), eqs. (10.13) take 
the symmetric form 

x = 7 ( 3 •!+ vt'), 

t = 7 {tf + vx'). (10.17) 


3. In matrix form, eqs. (10.16) are 



7 7/3 \ ( x’ \ 

7/3 7 ) V ct ' ) ' 


(10.18) 


This looks similar to a rotation matrix. More about this in Section 10.7, and in 
Problem 26. 


4. The plus or minus sign on the right-hand side of the L.T.’s in eqs. (10.13) and 
(10.15) corresponds to which way the coordinate system on the left-hand side sees the 
coordinate system on the right-hand side moving. But if you get confused about the 
sign, simply write down xa = 7 (xb ± vts), and then imagine sitting in system A and 
looking at a fixed point in B, which satisfies (putting the A’s back in to avoid any 
mixup) Ax rj = 0 , which gives Ax a = ± 7 vA t B - If the point moves to the right (that 
is, if it increases as time increases), then pick the “+”. If it moves to the left, then 
pick the In other words, the sign is determined by which way A sees B moving. 

5. One very important thing we must check is that two successive Lorentz transforma¬ 
tions (from Si to S2 and then from S 2 to S3) again yield a Lorentz transformation 
(from Si to S3). This must be true because we showed that any two frames must be 
related by eq. (10.13). If we composed two L.T.’s and found that the transformation 
from Si to S3 was not of the form of eqs. (10.13), for some new v, then the whole 
theory would be inconsistent, and we would have to drop one of our postulates. We’ll 
let you show that the combination of an L.T. (with speed t>i) and an L.T. (with speed 
V 2 ) does indeed yield an L.T. (with speed (vi + V 2 )/(I + v\ V 2 /C 2 ), which we’ll see 
again in the velocity-addition formula in Section 10.3.3). This is the task of Exercise 
14, and also Problem 26 (which is stated in terms of rapidity, introduced in Section 
10.7). * 


Example: A train with proper length L moves with speed 5c/13 with respect to 
the ground. A ball is thrown from the back of the train to the front. The speed of 
the ball with respect to the train is c/3. As viewed by someone on the ground, how 
much time does the ball spend in the air, and how far does it travel? 

Solution: The 7 factor associated with the speed 5c/13 is 7 = 13/12. The two events 
we are concerned with are “ball leaving back of train” and “ball arriving at front of 
train.” The spacetime separation between these events is easy to calculate on the 
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train. We have Ax T = L, and A t T = L/(c/ 3) = 3 L/c. The Lorentz transformations 
giving the coordinates on the ground are 


Therefore, 


x G = 7 (x T +vt T ), 

t G = 7(t T + vx T /c 2 ). 


13/ /5 c\f3L\\ 

XG = r2[ L+ {u){-7j) 


7 L 
3 ’ 


and 


(10.19) 


to 


13 /31/ ffM ILL 
12 \ 7 + ~^) ~ “37 ' 


( 10 . 20 ) 


In a given problem, such as the above example, one of the frames usually allows 
for an easy calculation of Ax and At, so you simply have to mechanically plug these 
quantities into the L.T.’s to obtain Ax’ and At' in the other frame, where they may 
not be as obvious. 

Relativity is a subject where there are usually many ways to do a problem. If 
you are trying to find some Ax’s and Af’s, then you can use the L.T.’s, or perhaps 
the invariant interval (introduced in Section 10.4), or maybe a velocity-addition 
approach (introduced in Section 10.3.3), or even the sending-of-light-signals strategy 
used in Section 10.2. Depending on the specific problem and what your personal 
preferences are, certain approaches will be more enjoyable than others. But no 
matter which method you choose, you should take advantage of the plethora of 
possibilities by picking a second method to double-check your answer. Personally, 
I find the L.T.’s to be the perfect option for this, because the other methods are 
generally more fun when solving a problem for the first time, while the L.T.’s are 
usually quick and easy to apply (perfect for a double-check). 18 

The excitement will build in your voice, 

As you rise from your seat and rejoice, 

“A Lorentz transformation 

Provides confirmation 

Of my alternate method of choice!” 

10.3.2 The fundamental effects 

Let us now see how the Lorentz transformations imply the three fundamental effects 
(namely, loss of simultaneity, time dilation, and length contraction) discussed in Sec¬ 
tion 10.2. Of course, we just used these effects to derive the Lorentz transformation, 
so we know everything will work out. We’ll just be going in circles. But since these 
fundamental effects are, well, fundamental, let’s belabor the point and discuss them 
one more time, with the starting point being the Lorentz transformations. 

18 I would, however, be very wary of solving a problem using only the L.T.’s, with no other check, 
because it’s very easy to mess up a sign in the transformations. And since there’s nothing to do 
except mechanically plug in numbers, there’s not much opportunity for an intuitive check, either. 
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Loss of Simultaneity 

Let two events occur simultaneously in frame S'. Then the separation between 
them, as measured by S', is (x',t') = (x',0). As usual, we are not bothering to write 
the A’s in front of the coordinates. Using the second of eqs. (10.13), we see that 
the time between the events, as measured by S, is t = 7 vx'/c 2 . This is not equal to 
zero (unless x' = 0). Therefore, the events do not occur simultaneously in frame S. 

Time dilation 

Consider two events that occur in the same place in S'. Then the separation between 
them is (x',t') = (0 ,t'). Using the second of eqs. (10.13), we see that the time 
between the events, as measured by S, is 

t = jtf (if x! — 0). (10.21) 

The factor 7 is greater than or equal to 1, so t > t'. The passing of one second on 
S' 's clock takes more than one second on S’s clock. S sees S' drinking his coffee 
very slowly. 

The same strategy works if we interchange S and S'. Consider two events that 
occur in the same place in S. The separation between them is (x, t) = (0, t). Using 
the second of eqs. (10.15), we see that the time between the events, as measured by 
S', is 

t 1 = jt (if x = 0 ). ( 10 . 22 ) 

Therefore, t' > t. Another way to derive this is to use the first of eqs. (10.13) to 
write x 1 = —vt', and then substitute this into the second equation. 

Remark: If we write down the two above equations by themselves, t = 'yt' and t! = 'yt, 
they appear to contradict each other. This apparent contradiction arises from the omission 
of the conditions they are based on. The former equation is based on the assumption that 
x' = 0. The latter equation is based on the assumption that x = 0. They have nothing to 
do with each other. It would perhaps be better to write the equations as 

(t = 7i')x'=o, 

(t' = 7 f) x=0 , (10.23) 

but this is somewhat cumbersome. £ 

Length contraction 

This proceeds just like the time dilation above, except that now we want to set 
certain time intervals equal to zero, instead of certain space intervals. We want to 
do this because to measure a length, we simply measure the distance between two 
points whose positions are measured simultaneously. That’s what a length is. 

Consider a stick at rest in S', where it has length £'. We want to find the length 
I in S. Simultaneous measurements of the coordinates of the ends of the stick in S 
yield a separation of ( x,t ) = (x, 0). Using the first of eqs. (10.15), we have 

x' = 'yx (if t = 0 ). 


(10.24) 
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But x is by definition the length in S. And x' is the length in S', because the stick 
is not moving in S'. 19 Therefore, I = £'/j. Since 7 > 1, we have i < £', so S sees 
the stick shorter than S' sees it. Another way to derive eq. (10.24) is to use the 
second of eqs. (10.13) to write t' = —(v/c 2 )x'. and then substitute this into the first 
equation. 

Now interchange S and S'. Consider a stick at rest in S, where it has length £. 
We want to find the length in S'. Measurements of the coordinates of the ends of 
the stick in S' yield a separation of (x',t') = (x',0). Using the first of eqs. (10.13), 
we have 

x = ^x' (if// -0). (10.25) 

But x' is by definition the length in S'. And x is the length in S, because the stick 
is not moving in S. Therefore, £' = I/7, so £! < £. 

Remark: As with time dilation, if we write down the two above equations by them¬ 
selves, l = £'/'y and £! = t'/y they appear to contradict each other. But as before, this 
apparent contradiction arises from the omission of the conditions they are based on. The 
former equation is based on the assumptions that t = 0 and that the stick is at rest in S'. 
The latter equation is based on the assumptions that t' = 0 and that the stick is at rest in 
S. They have nothing to do with each other. We should really write, 

(x = 

(x 1 = a;/7) t / =0 , (10.26) 

and then identify x' in the first equation with £! only after invoking the further assumption 
that the stick is at rest in S'; likewise for the second equation. But this is a pain. £ 



S 

Figure 10.15 


10.3.3 Velocity addition 
Longitudinal velocity addition 

Consider the following setup. An object moves at speed v\ with respect to frame 
S'. And frame S' moves at speed iy with respect to frame S, in the same direction 
as the motion of the object; see Fig. 10.15. What is the speed, it, of the object with 
respect to frame S’? 

The Lorentz transformations may be used to easily answer this question. The 
relative speed of the frames is iy. Consider two events along the object’s path (for 
example, say it makes some beeps). We are given that Ax'/At' = v\. Our goal is 
to find u = Ax/At. 

The Lorentz transformations from S' to S, eqs. (10.13), are 


Ax = %{Ax' + iyAt'), and At = 72 (At 7 + iyAa//c 2 ), (10.27) 


19 The measurements of the ends made by S will not be simultaneous in the S' frame. In the S' 
frame, the separation between the events is (x', t'), where both x' and t' are nonzero. This does not 
satisfy our definition of a length measurement in S' (because t' ^ 0), but the stick is not moving in 
S', so S' can measure the ends whenever he feels like it, and he will always get the same difference. 
So x' is indeed the length in the S' frame. 
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where 72 = — v^/c 2 . Therefore, 

_ Ax _ Ax' + v-zAt' 

U At At' + V 2 A x'/c 2 

Ax' / At' + V 2 
1 + u 2 (Ax'/A t')/c 2 

V\ + V2 
1 + Vl v 2 /c 2 ' 


(10.28) 


This is the velocity-addition formula (for adding velocities in the same direction). 
Let’s look at some of its properties. (1) It is symmetric with respect to v\ and i> 2 , 
as it should be, because we could switch the roles of the object and frame S. (2) 
For V 1 V 2 <C c 2 , it reduces to u ~ v\ + u 2 , which we know holds perfectly well for 
everyday speeds. (3) If iq = c or u 2 = c, then we find u = c, as should be the 
case, because anything that moves with speed c in one frame moves with speed c in 
another. (4) The maximum (or minimum) of u in the region — c < v\, u 2 < c equals 
c (or —c), which can be seen by noting that du/dv 1 and du/dv 2 are never zero in 
the interior of the region. 

If you take any two velocities that are less than c, and add them according to eq. 
(10.28), then you will obtain a velocity that is again less than c. This shows that 
no matter how much you keep accelerating an object (that is, no matter how many 
times you give the object a speed v\ with respect to the frame moving at speed i; 2 
that it was just in), you can’t bring the speed up to the speed of light. We’ll give 
another argument for this result in Chapter 11 when we discuss energy. 

Remark: Consider the two scenarios shown in Fig. 10.16. If the goal is to find the 
speed of A with respect to C, then the velocity-addition formula applies to both scenarios, 
because the second scenario is simply the first one, as observed in B’s frame. 

The velocity-addition formula applies when we ask, “If A moves at v-\ with respect to 
B, and B moves at u 2 with respect to C (which means, of course, that C moves at speed 
v <2 with respect to B), then how fast does A move with respect to C?” The formula does 
not apply if we ask the more mundane question, “What is the relative speed of A and C, as 
viewed by B?” The answer to this is simply V\ + u 2 . 

In short, if the two speeds are given with respect to B, and if you are asking for the 
relative speed as measured by B, then you simply add the speeds. 20 But if you are asking 
for the relative speed as measured by A or C, then you have to use the velocity-addition 
formula. 4k 



C 


A V ‘> 


B 


Figure 10.16 


Example: Consider again the scenario in the first example in Section 10.2.3. 

(a) How long, as viewed by A and as viewed by B, does it take for A to overtake 
B? 

20 Note that the resulting speed can certainly be greater than c. If I see a ball heading toward 
me at .9c from the right, and another one heading toward me at .9c from the left, then the relative 
speed of the balls in my frame is 1.8c. In the frame of one of the balls, however, the relative speed 
is (1.8/1.81)c « (,9945)c, from eq. (10.28). 
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Figure 10.17 



Figure 10.18 


(b) Let event E\ be “the front of A passing the back of B ”, and let event E 2 be “the 
back of A passing the front of B”. Person D walks at constant speed from the 
back of train B to its front (see Fig. 10.17), such that he coincides with both 
events E-\ and E 2 . How long does the “overtaking” process take, as viewed by 
D? 


Solution: 


(a) First consider B ’s point of view. From the velocity-addition formula, B s 
move with speed 


1 - 


13 ' 


(10.29) 


The 7 factor associated with this speed is 7 = 13/12. Therefore, B sees A’s 
train contracted to a length 12L/13. During the overtaking, A must travel a 
distance equal to the sum of the lengths of the trains in B's frame, which is 
L + 12L/13 = 25L/13. Since A moves at speed 5c/13, the total time in B ’s 
frame is 

25L/13 5 L 

<10 ' 30) 


The exact same reasoning holds from A’s point of view, so we have t A = ts = 
5 L/c. 

(b) Look at things from D ’s point of view. D is at rest, and the two trains move with 
equal and opposite speeds, v, as shown in Fig. 10.18. The relativistic addition 
of v with itself must equal the 5c/13 speed we found in part (a). Therefore, 


2v _ 5c 
1 + v 2 /c 2 ~ 13 


(10.31) 


The 7 factor associated with this speed is 7 = 5/(2\/6). Therefore, D sees both 
trains contracted to a length 2\/6L/5. During the overtaking, each train must 
travel a distance equal to its length, because both events, E\ and E 2 , take place 
right at D. The total time in D’s frame is therefore 


2V6L/5 _ 2V6L 
c/5 c 


(10.32) 


Remarks: There are a few double-checks we can perform. The speed of D with 
respect to the ground can be obtained by either relativistically adding 3c/5 and c/5, 
or subtracting c/5 from 4c/5. These both give the same answer, namely 5c/7, as they 
must. (The c/5 speed can in fact be determined by this reasoning, instead of using 
eq. (10.31).) The 7 factor between the ground and D is therefore 7/2^6. We can now 
use time dilation to say that someone on the ground sees the overtaking take a time 
of (7/2-\/6)fr>. (We can say this because both events happen right at D.) Using eq. 
(10.32), this gives a ground-frame time of 7L/c, in agreement with eq. (10.11). 
Likewise, the 7 factor between D and either train is 5/2-\/6. So the time of the 
overtaking as viewed by either A or B is (5/2\/6)fz3 = 5 L/c, in agreement with eq. 
(10.30). 

Note that we cannot use simple time dilation to relate the ground to A or B, because 
the two events don’t happen at the same place in the train frames. But since both 
events happen at the same place in D’s frame, namely right at D, we can indeed use 
time dilation to go from D’s frame to any other frame. £ 
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Transverse velocity addition 


Consider the following general two-dimensional situation. An object moves with 
velocity (u' x ,u' y ) with respect to frame S'. And frame S' moves with speed v with 
respect to frame S, in the x-direction (see Fig. 10.19). What is the velocity, (u x , u y ). 
of the object with respect to frame SI 

The existence of motion in the [{/-direction doesn’t affect the preceding derivation 
of the speed in the x-direction, so eq. (10.28) is still valid. In the present notation, 
it becomes 


u x 



1 + u’ x v/c 2 ■ 


(10.33) 


To find u y , we may again make easy use of the Lorentz transformations. Consider 
two events along the object’s path. We are given that Ax'/Aif = u' x , and Ay'/At 1 = 
u'y. Our goal is to find u y = Ay/At. The relevant Lorentz transformations from S' 
to S in eqs. (10.13) are 


Ay = Ay', and At = q(At' + vAx'/c 2 ). (10.34) 


Therefore, 


= 

Uy ~ At 


Remark: In the special case where u' x = 0, we have u y = u' y / 7. When u' y is small and 
v is large, this result can be seen to be a special case of time dilation, in the following way. 
Consider a series of equally spaced lines parallel to the x-axis (see Fig. 10.20). Imagine 
that the object’s clock ticks once every time it crosses a line. Since u' y is small, the object’s 
frame is essentially frame S’, so the object is essentially moving at speed v with respect to 
S. Therefore, S sees the clock run slow by a factor 7. This means that S sees the object 
cross the lines at a slower rate, by a factor 7 (because the clock still ticks once every time 
it crosses a line; this is a frame-independent statement). Since distances in the {/-direction 
are the same in the two frames, we conclude that u y = u' y / 7. This 7 factor will be very 
important when we deal with momentum in Chapter 11. 

To sum up: if you run in the x-direction past an object, then its {/-speed slows down 
from your point of view (or speeds up, depending on the relative sign of u' x and v). Strange 
indeed, but no stranger than other effects we’ve seen. Problem 15 deals with the special 
case where u' x = 0, but where u' y is not necessarily small. £ 


A y' 

7(A t' + vAx'/c 2 ) 

Ay'/At' 

7(1 + v(Ax'/A t')/c 2 ) 
7(1 + u' x v/c 2 ) 


(10.35) 


10.4 The invariant interval 

Consider the quantity, 

(As) 2 = c 2 (At) 2 - (Ax) 2 . (10.36) 



u'x S' 


S 

Figure 10.19 


S' 

l 





S 

Figure 10.20 
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Technically, we should also subtract off (Ay) 2 and (Az) 2 . but nothing exciting 
happens in the transverse directions, so we’ll ignore these. Using eq. (10.13), we 
can write (As) 2 in terms of the S' coordinates, Ax' and At'. The result is (dropping 
the A’s) 

2 2 2 _ C 2 (t' + VX'/C 2 ) 2 (x' + vt') 2 

Ct ~ X ~ 1 - V 2/ c 2 1 - V 2 /C 2 

t' 2 (c 2 - V 2 ) - X ,2 (l - v 2 /c 2 ) 

1 - v 2 /c 2 
= c 2 t' 2 -x' 2 

= s' 2 . (10.37) 


We see that the Lorentz transformations imply that the quantity c 2 t 2 — x 2 does 
not depend on the frame. This result is more than we bargained for, for the following 
reason. The speed-of-light postulate says that if c 2 t' 2 — x' 2 = 0, then c 2 t 2 — x 2 = 0. 
But eq. (10.37) says that if c 2 t ,2 —x' 2 = b , then c 2 t 2 — x 2 = b, for any b, not just zero. 
This is, as you might guess, very useful. There are enough things that change when 
we go from one frame to another, so it’s nice to have a frame-independent quantity 
that we can hang on to. The fact that s 2 is invariant under Lorentz transformations 
of x and t is exactly analogous to the fact that r 2 is invariant under rotations in the 
x-y plane. The coordinates themselves change under the transformation, but the 
special combination of c 2 t 2 — x 2 for Lorentz transformations, of x 2 + y 2 for rotations, 
remains the same. All inertial observers agree on the value of s 2 , independent of 
what they measure for the actual coordinates. 


“Potato?! Potahto!” said she, 

“And of course it’s toma/ito, you see. 

But the square of ct 
Minus x 2 will be 

Always something on which we agree.” 

A note on terminology: The separation in the coordinates, (cAt, Ax), is usually 
referred to as the spacetime interval, while the quantity (As) 2 = c 2 (At) 2 — (Ax) 2 
is referred to as the invariant interval (or technically the square of the invariant 
interval). At any rate, just call it s 2 , and people will know what you mean. 

The invariance of s 2 is actually just a special case of more general results involv¬ 
ing inner products and 4-vectors, which we’ll discuss in Chapter 12. Let’s now look 
at the physical significance of s 2 = c 2 t 2 — x 2 . There are three cases to consider. 


Case 1: s 2 > 0 (timelike separation) 

In this case, we say that the two events are timelike separated. We have c 2 t 2 > x 2 , 
and so \x/t\ < c. Consider a frame S' moving at speed v with respect to S. The 
Lorentz transformation for x is 


x' = q(x — vt). 


(10.38) 
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Since \x/t\ < c, there exists a v which is less than c (namely v = x/t) that makes 
x' = 0. In other words, if two events are timelike separated, it is possible to find a 
frame S' in which the two events happen at the same place. In short, the condition 
\x/t\ < c means that it is possible for a particle to travel from one event to the 
other. The invariance of s 2 then gives s 2 = c 2 t' 2 — x' 2 = c 2 t' 2 . So we see that s/c 
is simply the time between the events in the frame in which the events occur at the 
same place. This time is called the proper time. 

Case 2: s 2 < 0 (spacelike separation) 

In this case, we say that the two events are spacelike separated. 21 We have c 2 t 2 < x 2 , 
and so \t/x\ <l/c. Consider a frame S' moving at speed v with respect to S. The 
Lorentz transformation for t' is 

t' = 7(i — vx/c 2 ). (10.39) 

Since \t/x\ < 1/c, there exists a v which is less than c (namely v = c 2 t/x) that 
makes t' = 0. In other words, if two events are spacelike separated, it is possible to 
find a frame S' in which the two events happen at the same time. (This statement 
is not as easy to see as the corresponding one in the timelike case above. But if you 
draw a Minkowski diagram, described in the next section, it becomes clear.) The 
invariance of s 2 then gives s 2 = c 2 t' 2 — x' 2 = —x' 2 . So we see that |s| is simply 
the distance between the events in the frame in which the events occur at the same 
time. This distance is called the proper distance. 

Case 3: s 2 = 0 (lightlike separation) 

In this case, we say that the two events are lightlike separated. We have c 2 t 2 = x 2 . 
and so \x/t\ = c. This holds in every frame, so in every frame a photon emitted at 
one of the events will arrive at the other. It is not possible to find a frame S' in 
which the two events happen at the same place or the same time, because the frame 
would have to travel at the speed of light. 


Example (Time dilation): An illustration of the usefulness of the invariance of s 2 
is a derivation of time dilation. Let frame S' move at speed v with respect to frame S. 
Consider two events at the origin of S', separated by time t'. The separation between 
the events is 


in S' : {x’. l') = (0,f'), 
in S : (x,t) = (vt,t). 

The invariance of s 2 implies c 2 t' 2 — 0 = c 2 t 2 — v 2 t 2 . Therefore, 


t = 


t' 

yi-^/c 2 ' 


(10.40) 


(10.41) 


Here it is clear that this time-dilation result rests on the assumption that x 1 = 0. 

21 It’s fine that s 2 is negative in this case, which means that s is imaginary. We can simply take 
the absolute value of s if we want to obtain a real number. 






X-26 


CHAPTER 10. RELATIVITY (KINEMATICS) 


Figure 10.21 


Example: Consider again the scenario in the examples in Sections 10.2.3 and 10.3.3. 
Verify that the s 2 between the events E\ and E 2 is the same in all of the frames, A, 
B, C, and D (see Fig. 10.21). 

Solution: The only quantity that we’ll need that we haven’t already found in the two 
examples above is the distance between Ei and E 2 in C’s frame (the ground frame). 
In this frame, train A travels at a rate 4c/5 for a time tc = 7L/c, covering a distance 
of 28L/5. But event E 2 occurs at the back of the train, which is a distance 3L/5 
behind the front end (this is the contracted length in the ground frame). Therefore, 
the distance between events E\ and E 2 in the ground frame is 28L/5 — 3L/5 = 5 L. 
You can also apply the same line of reasoning using train B, in which the 5 L takes 
the form (3c/5)(7L/c) +4L/5. 

Putting all of the previous results together, we have the following separations between 
events: 



A 

B 

C 

D 

At 

5 L/c 

5L/c 

7L/c 

2y/6L/c 

Ax 

-L 

L 

5 L 

0 


From the table, we see that As 2 = c 2 At 2 — Ax' 2 = 24 L 2 for all four frames, as desired. 
We could have, of course, worked backwards and used the s 2 = 24 L 2 result from 
frames A, B, or D, to deduce that Ax = 5 L in frame C. 

In Problem 11, you are asked to perform the mundane task of checking that the values 
in the above table satisfy the Lorentz transformations between the various pairs of 
frames. 


10.5 Minkowski diagrams 

Minkowski diagrams (or “spacetime” diagrams) are extremely useful in seeing how 
coordinates transform between different reference frames. If you want to produce 
exact numbers in a problem, you will probably have to use one of the strategies 
we’ve encountered so far. But as far as getting an overall intuitive picture in a 
problem goes (if there is in fact any such thing as intuition in relativity), there is 
no better tool than a Minkowski diagram. Here’s how you make one. 

Let frame S' move at speed v with respect to frame S (along the x-axis, as usual; 
ignore the y and z components). Draw the x and ct axes of frame S. 22 What do 
the x' and ct 1 axes of S' look like, superimposed on this diagram? That is, at what 
angles are the axes inclined, and what is the size of one unit on these axes? (There 
is no reason why one unit on the x' and ct 1 axes should have the same length on the 
paper as one unit on the x and ct axes.) We can answer these questions by using 
the Lorentz transformations, eqs. (10.13). We’ll first look at the ct' axis, and then 
the x' axis. 

22 We choose to plot ct instead of t on the vertical axis, so that the trajectory of a light beam lies 
at a nice 45° angle. Alternatively, we could choose units where c = 1. 
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ct '-axis angle and unit size 

Look at the point ( x',ct ') = (0,1), which lies on the ct' axis, one ct! unit from 
the origin (see Fig. 10.22). Eqs. (10.13) tell us that this point is the point 
(. x,ct ) = (yu/c, 7 ). The angle between the ct ' and ct axes is therefore given by 
tan = x/ct = v/c. With /3 = v/c, we have 

tan#! = (3. (10.42) 



N(x',cO=(l,0) 

Mx,c?)=(y,Py) 


Alternatively, the ct' axis is simply the “worldline” of the origin of S'. (A worldline 
is simply the path an object takes as it travels through spacetime.) The origin moves 
at speed v with respect to S. Therefore, points on the ct' axis satisfy x/t = v, or 
x/ct = v/c. 

On the paper, the point ( x',ct ') = (0,1), which we just found to be the point 
(. x,ct ) = ('yv/c, 7 ), is a distance y^/l + v 2 / e 2 from the origin. Using the definitions 
of (3 and 7 , we see that 


one ct' unit 
one ct unit 



(10.43) 


as measured on a grid where the x and ct axes are orthogonal. Note that this ratio 
approaches infinity as (3 —> 1. And it of course equals 1 if f3 = 0. 


Figure 10.22 


x'-axis angle and unit size 

The same basic argument holds here. Look at the point ( x',ct ') = (1,0), which lies 
on the x' axis, one x' unit from the origin (see Fig. 10.22). Eqs. (10.13) tell us that 
this point is the point ( x,ct ) = ( 7,7 v/c). The angle between the x' and x axes is 
therefore given by tan #2 = ct/x = v/c. So, as in the ct'-axis case, 

tan 0 2 = (3. (10.44) 


On the paper, the point ( x',ct ') = (1,0), which we just found to be the point 
(x,ct) = ( 7,7 v/c), is a distance 71 /I + v l /(? from the origin. So, as in the ct!- axis 


one x' unit 
one x unit 



(10.45) 


as measured on a grid where the x and ct axes are orthogonal. Both the x' and 
ct' axes are therefore stretched by the same factor, and tilted by the same angle, 
relative to the x and ct axes. 


Remarks: If v/c = (3 = 0, then Q 1 = 0 2 = 0, so the ct' and x' axes coincide with the 
ct and x axes, as they should. If (3 is very close to 1, then the x' and ct' axes are both 
very close to the 45° light-ray line. Note that since 6\= 6 2 , the light-ray line bisects the x' 
and ct' axes. Therefore (as we verified above), the scales on these axes must be the same, 
because a light ray must satisfy x' = ct'. X 

We now know what the x' and ct' axes look like. Given any two points in a 
Minkowski diagram (that is, given any two events in spacetime), we can simply read 
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left 



Figure 10.23 


off (if our graph is accurate enough) the Ax, Act, Ax', and Act' quantities that our 
two observers would measure. These quantities must of course be related by the 
Lorentz transformations, but the advantage of a Minkowski diagram is that you can 
actually see geometrically what’s going on. 

There are very useful physical interpretations of the ct 1 and x' axes. If you stand 
at the origin of S 1 , then the ct' axis is the “here” axis, and the x' axis is the “now” 
axis. That is, all events on the ct' axis take place at your position (the ctf axis is 
your worldline, after all), and all events on the x' axis take place simultaneously 
(they all have t' = 0 ). 


Example (Length contraction) : For both parts of this problem, use a Minkowski 
diagram where the axes in frame S are orthogonal. 

(a) The relative speed of S' and S is v (along the x direction). A 1-meter stick (as 
measured by S') lies along the x' axis and is at rest in S'. S measures its length. 
What is the result? 

(b) Do the same problem, except with S and S' interchanged. 

Solution: 

(a) Without loss of generality, pick the left end of the stick to be at the origin in 
S'. Then the worldlines of the two ends are shown in Fig. 10.23. The distance 
AC is 1 meter in S n s frame (because A and C are the endpoints of the stick 
at simultaneous times in the S' frame; this is how a length is measured). And 
since one unit on the x' axis has length \J \ + d 2 /- /3 2 , this is the length on 
the paper of the segment AC. 

How does S measure the length of the stick? He simply writes down the x 
coordinates of the ends at simultaneous times (as measured by him, of course), 
and takes the difference. Let the time he makes the measurements be t = 0. 
Then he measures the ends to be at the points A and B. 23 
Now it’s time to do some geometry. We have to find the length of segment AB 
in Fig. 10.23, given that segment AC has length — ft 1 . We know 

that the primed axes are tilted at an angle #, where tan# = f3. Therefore, CD = 
(AC) sin 6 . And since /BCD = 6 , we have BD = (CD) tan 9 = (AC) sin 9 tan 9. 
Therefore (using tan# = (3), 

AB = 


23 If S measures the ends in a dramatic fashion by, say, blowing them up, then S' will see the 
right end blow up first (the event at B has a negative t' coordinate, because it lies below the x' 
axis), and then a little while later S' will see the left end blow up (the event at A has t' = 0). So 
S measures the ends at different times in S''s frame. This is part of the reason why S' should not 
be at all surprised that S’s measurement is smaller than lm. 


AD-BD 

(AC) cos # — (AC) sin # tan # 
(AC) cos #(1 — tan 2 #) 
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Therefore, S measures the meter stick to have length y/l — JS^ which is the 
standard length-contraction result. 

(b) The stick is now at rest in S, and we want to find the length that S' measures. 
Pick the left end of the stick to be at the origin in S. Then the worldlines of the 
two ends are shown in Fig. 10.24. The distance AB is 1 meter in 5”s frame. 

In measuring the length of the stick, S' writes down the x' coordinates of the 
ends at simultaneous times (as measured by him), and takes the difference. Let 
the time he makes the measurements be t' = 0. Then he measures the ends to 
be at the points A and E. 

Now we do the geo metry, which is easy in this case. The length of AE is 
simply 1/cos# = \Jl + fP. But since one unit along the x/ axis has length 
y/l + fPJy/l — P 2 °n the paper, we see that AE is yfl — (P of one unit in S n s 
frame. Therefore, S' measures the meter stick to have length *j$T—pPjs which 
again is the standard length-contraction result. 


The analysis used in the above example also works, of course, for time intervals. 
The derivation of time dilation, using a Minkowski diagram, is the task of Exercise 
28. 

10.6 The Doppler effect 

10.6.1 Longitudinal Doppler effect 

Consider a source that emits flashes at frequency f (in its own frame), while moving 
directly toward you at speed v, as shown in Fig. 10.25. With what frequency do 
the flashes hit your eye? 

In these Doppler-effect problems, you must be careful to distinguish between 
the time at which an event occurs in your frame, and the time at which you see the 
event occur. This is one of the few situations where we will be concerned with the 
latter. 

There are two effects contributing to the longitudinal Doppler effect. The first is 
relativistic time dilation; the flashes occur at a smaller frequency in your frame, that 
is, there is more time between them. The second is the everyday Doppler effect (as 
with sound), arising from the motion of the source; successive flashes have a smaller 
distance (or larger, if v is negative) to travel to reach your eye. This effect increases 
the frequency at which the flashes hit your eye (or decreases it, if v is negative). 

Let’s now be quantitative and find the observed frequency. The time between 
emissions in the source’s frame is At' = 1 //'. The time between emissions in your 
frame is At = "{At/. by the usual time dilation. So the photons of one flash have 
traveled a distance (in your frame) of cAt = c"/At' by the time the next flash occurs. 
During this time between emissions, the source has traveled a distance vAt = v"{At/ 
toward you in your frame. Hence, at the instant the next flash occurs, the photons 
of this next flash are a distance (in your frame) of cAt — vAt = (c — v)"/At/ behind 
the photons of the previous flash. This result holds for all adjacent flashes. The 


left 
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Figure 10.24 
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time, AT, between the arrivals of the flashes at your eye is 1/c times this distance. 
Therefore, 


(10.47) 


where (3 = v/c. Hence, the frequency you see is 


1 

" AT “ 



(10.48) 


If P > 0 (that is, the source is moving toward you), then /> ./'. The everyday 
Doppler effect wins out over the time-dilation effect. In this case we say that the 
light is “blue-shifted,” because blue light is at the high-frequency end of the visible 
spectrum. The light need not have anything to do with the color blue, of course; 
by “blue” we simply mean that the frequency is increased. If f3 < 0 (that is, the 
source is moving away from you), then f < f. Both effects serve to decrease the 
frequency. In this case we say that the light is “red-shifted,” because red light is at 
the low-frequency end of the visible spectrum. 


10.6.2 Transverse Doppler effect 

Consider a source that emits flashes at frequency f (in its own frame), while moving 
across your field of vision at speed v. There are two reasonable questions we may 
ask about the frequency you observe: 


• Case 1: 

At the instant the source is at its closest approach to you, with what frequency 
do the flashes hit your eye? 


- - - - * 

i: 

\ 

case 1 


* 


Figure 10.26 

y 

case 2 J 


• Case 2: 

When you see the source at its closest approach to you, with what frequency 
do the flashes hit your eye? 

The difference between these two scenarios is shown in Fig. 10.26 and Fig. 10.27, 
where the source’s motion is taken to be parallel to the x-axis. 

In the first case, the photons you see must have been emitted at an earlier time, 
because the source will have moved during the non-zero time it takes the light to 
reach you. In this scenario, we are dealing with photons that hit your eye when the 
source crosses the y-axis. You will therefore see the photon come in at an angle with 
respect to the y-axis. 

In the second case, you will see the photons come in along the y-axis (by the 
definition of this scenario). At the instant you observe such a photon, the source 
will be at a position past the y-axis. 

Let’s now find the observed frequencies in these two scenarios. 


Figure 10.27 
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Case 1 

Let your frame be S, and let the source’s frame be S'. Consider the situation from 
S n s point of view. S' sees you moving across his field of vision at speed v. The 
relevant photons hit your eye when you cross the y '-axis of S n s frame. Because of 
time dilation, your clock ticks slowly (in S n s frame) by a factor 7 . 

Now, S' sees you get hit by a flash every At' = 1 jf seconds in his frame. 
(This is true because when you are very close to the y'-axis, all points on your path 
are essentially equidistant from the source. So we don’t have to worry about any 
longitudinal effects.) This means that you get hit by a flash every AT = At'/j = 
1 /(/' 7 ) seconds in your frame. Therefore, the frequency in your frame is 


(10.49) 


Hence, / is greater than f: you see the flashes at a higher frequency than S' emits 
them. 


Case 2 

Again, let your frame be S, and let the source’s frame be S'. Consider the situation 
from your point of view. Because of the time dilation, S"’s clock ticks slowly (in 
your frame) by a factor of 7 . When you see the source cross the y-axis, you therefore 
observe a frequency of 


(10 ' 50) 

We have used the fact the relevant photons are emitted from points that are essen¬ 
tially equidistant from you. So they all travel the same distance, and we don’t have 
to worry about any longitudinal effects. 

We see that / is smaller than f ; you see the flashes at a lower frequency than 
S' emits them. 

Remarks: 

1. When people talk about the “transverse Doppler effect”, they sometimes mean Case 
1, and they sometimes mean Case 2. The title “transverse Doppler” is therefore 
ambiguous, so you should remember to state exactly which scenario you are talking 
about. 

2. The two scenarios may alternatively be described, respectively (as you can convince 
yourself), in the following ways (see Fig. 10.28). 

• Case 1: 

A receiver moves with speed v in a circle around a source. What frequency does 
the receiver register? 

• Case 2: 

A source moves with speed v in a circle around a receiver. What frequency does 
the receiver register? 



Figure 10.28 
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These setups involve accelerating objects. We must therefore invoke the fact (which 
has plenty of experimental verification) that if an inertial observer observes a moving 
clock, then only the instantaneous speed of the clock is important in computing the 
time dilation factor. The acceleration is irrelevant . 24 

3. Beware of the following incorrect reasoning for Case 1, leading to an incorrect version 
of eq. (10.49). “S sees things in S' slowed down by a factor 7 (that is, A t = 7 A t 1 ), 
by the usual time dilation effect. Hence, S sees the light flashing at a slower pace. 
Therefore, / = /'/ 7 .” This reasoning puts the 7 in the wrong place. Where is the 
error? The error lies in confusing the time at which an event occurs in S ’s frame, with 
the time at which S sees (with his eyes) the event occur. The flashes certainly occur 
at a lower frequency in S, but due to the motion of S' relative to S, it turns out that 
the pulses meet S’s eye at a faster rate (because the source is moving slightly towards 
S while it is emitting the relevant photons). We’ll let you work out the details of the 
situation from S ’s point of view . 25 

Alternatively, the error can be stated as follows. The time dilation result, At = yAt', 
rests on the assumption that the Ax' between the two events is zero. This applies 
fine to two emissions of light from the source. However, the two events in question are 
the absorption of two light pulses by your eye (which is moving in S'), so At = 7A t' 
is not applicable. Instead, At' = jAt is the relevant result, valid when Ax = 0. 

4. Other cases that are “inbetween” the longitudinal and transverse cases can also be 
considered. But they get a little messy. 4 


10.7 Rapidity 

Definition 

Let us define the rapidity, 0, by 


tsmh0 = 0= V . (10.51) 

c 

A few properties of the hyperbolic trig functions are given in Appendix A. In par¬ 
ticular, tanh (j> = ( e $ — e~^)/(e^ + e~ ^). The rapidity, <j), defined in eq. (10.51) is 
very useful in relativity because many of our expressions take on a particularly nice 
form when written in terms of it. 

Consider, for example, the velocity-addition formula. Let 0i = tanh 0\ and 
(3 2 = tanh 02 - Then if we add 0\ and 02 using the velocity-addition formula, eq. 
(10.28), we obtain 


01 + 02 
1 + 0102 


tanh 0 1 + tanh r/) 2 
1 + tanh <0\ tanh 02 


= tanh(</>i + 02 ), 


(10.52) 


where we have used the addition formula for tanh 0 (which you can prove by writing 
things in terms of the exponentials, e ±(l> ). Therefore, while the velocities add in the 
strange manner of eq. (10.28), the rapidities add by standard addition. 

24 The acceleration is, however, very important if things are considered from an accelerating 
object’s point of view. But we’ll wait until Chapter 13 on General Relativity to talk about this. 

25 This is a fun exercise (Exercise 31), but it should convince you that it is much easier to look 
at things in the frame in which there are no longitudinal effects, as we did in our solutions above. 
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The Lorentz transformations also take a nice form when written in terms of the 
rapidity. Our friendly 7 factor can be written as 


/l \J 1 — tanh 2 (f> 

P _ tanh 4 > 


= cosh </>. 


(10.53) 


7/3 = . = , = sinh <f). 

/^ \Jl — tanh 2 (/> 


(10.54) 


Therefore, the Lorentz transformations in matrix form, eqs. (10.18), become 


( x \ _ ( cosh (j) sinh <p \ ( x' \ 
ct ) l sinh (j) cosh <p J l ct' ) ' 


(10.55) 


This transformation looks similar to a rotation in a plane, which is given by 


( x \ _ ( cos 9 —sin# \ / x' \ 
\y J Isin# cos 0 J l y' J ’ 


(10.56) 


except that we now have hyperbolic trig functions instead of trig functions. The 
fact that the interval s 2 = c 2 t 2 — x 2 does not depend on the frame is clear from eq. 
(10.55); the cross terms in the squares cancel, and cosh 2 </> — sinh 2 (f> = 1 . (Compare 
with the invariance of r 2 = x 2 + y 2 for rotations in a plane.) 

The quantities associated with a Minkowski diagram also take a nice form when 
written in terms of the rapidity. The angle between the S and S' axes satisfies 


tan# = (3 = tanh(/>. 

And the size of one unit on the x' or ct' axes is, from eq. 

/1 + tanh 2 p / 7 "T“ . , 2 , 

= 4 /-= v cosh (j> + smh (j> = 

V 1 — tanh (p 

For large <f>, this is approximately equal to e^/y/2. 



(10.43), 


(10.57) 


(10.58) 


Physical meaning 

The fact that the rapidity, (j), makes many of our formulas look nice and pretty is 
reason enough to consider it. But in addition, it turns out to have a very meaningful 
physical interpretation. 

Consider the following setup. A spaceship is initially at rest in the lab frame. 
At a given instant, it starts to accelerate. Let a be the proper acceleration, which 
is defined as follows. Let t be the time coordinate in the spaceship’s frame . 26 If the 

26 This frame is of course changing as time goes by, because the spaceship is accelerating. The 
time t is simply the spaceship’s proper time. Normally, we would denote this by t', but we don’t 
want to have to keep writing the primes over and over in the following calculation. 
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proper acceleration is a, then at time t + dt, the spaceship is moving at speed a dt 
relative to the frame it was in at time t. Equivalently, the astronaut feels a force of 
ma applied to his body by the spaceship. If he is standing on a scale, then the scale 
shows a reading of F = ma. 

What is the relative speed of the spaceship and the lab frame at (the spaceship’s) 
time i? 

We can answer this question by considering two nearby times and using the 
velocity-addition formula, eq. (10.28). From the definition of a, eq. (10.28) gives 
(with v\ = adt and V 2 = v(t)) 


v(t + dt) 


v(t) + adt 
1 + v(t)adt/c 2 ’ 


Expanding this to first order in dt yields 27 


(10.59) 



(10.60) 


Separating variables and integrating gives, using 1/(1 — z 2 ) = 1/2(1 —z) +1/2(1 +z), 

[ (r+s + tt+) dv=2a Si dt (1M1) 

This yields In [(1 + v/c)/( 1 — v/c)\ = 2 at/c. Solving for v, we find 28 


v(t) = c 


/ e 2ot/c_ 1 \ 

y e 2at/c +1 ) 


ct&nh(at/c). 


(10.62) 


Note that for small a or small t (more precisely, for at/c < 1), we obtain v(t) & at, 
as we should (because tanhz ~ z for small z, which follows from the exponential 
form of tanh). And for at/c 1, we obtain v(t) m c, as we should. 

If a happens to be a function of time, a(t), then we can’t take the a outside the 
integral in eq. (10.61), so we instead end up with the general formula, 

v(t) = ctanh J a(t) dt^j . (10.63) 

We therefore see that the rapidity, (j), as defined in eq. (10.51), is given by 

6U) m-f a(t ) dt. (10.64) 

c Jo 

Note that whereas v has c as a limiting value, <fi can become arbitrarily large. 
Looking at eq. (10.63) we see that the <j> associated with a given v is simply 1/mc 

27 Equivalently, just take the derivative of (v + w)/(l + vw/c 2 ) with respect to w, and then set 
w = 0. 

28 Alternatively, you can just use J 1/(1 — z 2 ) = tanh -1 z. You can also use the result of Problem 
12 to find v(t). See the remark in the solution to that problem (after trying to solve it, of course!). 
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times the time integral of the force (felt by the astronaut) needed to bring the 
astronaut up to speed v. By applying a force for an arbitrarily long time, we can 
make cj) arbitrarily large. 

The integral / a(t) dt may be described as the naive, incorrect speed. That is, 
it is the speed that the astronaut might think he has, if he has his eyes closed and 
knows nothing about the theory of relativity. And indeed, his thinking would be 
essentially correct for small speeds. This quantity / a(t) dt seems like a reasonably 
physical thing, so if there is any justice in the world, / a(t) dt = f F(t) dt/m should 
have some meaning. And indeed, although it doesn’t equal v, all you have to do to 
get v is take a tanh and throw in some factors of c. 

The fact that rapidities add via simple addition when using the velocity-addition 
formula, as we saw in eq. (10.52), is evident from eq. (10.63). There is really nothing 
more going on here than the fact that 

[ a(t) dt = [ a(t) dt+ [ a(t) dt. (10.65) 

Jt 0 Jto Jt l 

To be explicit, let a force be applied from to to t\ that brings a mass up to speed 
Pi = tanh (pi = tanh( /^' a dt). and then let an additional force be applied from t\ 
to t r 2 that adds on an additional speed of P 2 = tanh <p 2 = tanh(/ t * 2 adt), relative 
to the speed at ti. Then the resulting speed may be looked at in two ways: (1) it 
is the result of relativistically adding the speeds Pi = tanh d>\ and p 2 = tanh d> 2 , 
and (2) it is the result of applying the force from to to t 2 (you get the same final 
speed, of course, whether or not you bother to record the speed along the way at ti), 
which is P = tanh(J t * 2 adt ) = tanh(«^i + ^> 2 ), where the second equality comes from 
the statement, eq. (10.65), that integrals simply add. Therefore, the relativistic 
addition of tanh <pi and tanh ^2 gives tanh(0i + fa), as we wanted to show. 

10.8 Relativity without c 

In Section 10.1, we introduced the two postulates of Special Relativity, namely the 
speed-of-light postulate and the relativity postulate. In Appendix I, we show that 
together these imply that the coordinate intervals in two frames must be related by 
the Lorentz transformations, eqs. (10.13). 

It is interesting to see what happens if we relax these postulates. It is hard 
to imagine a “reasonable” universe where the relativity postulate does not hold 
(geocentric theories aside), but it is easy to imagine a universe where the speed 
of light depends on the frame of reference. Light could behave like a baseball, for 
example. So let’s drop the speed-of-light postulate and see what we can say about 
the coordinate transformations between frames, using only the relativity postulate. 

In Appendix I, the form of the transformations, just prior to invoking the speed- 
of-light postulate, is given in eq. (14.83) as 

x = A v (x' + vt'), 

t = A,(t' + I(a-T)x'). 


( 10 . 66 ) 
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We’ll put a subscript on A in this section, to remind us of the v dependence. Can 
we say anything about A v without invoking the speed-of-light postulate? Indeed we 
can. Define V v by 

M~(i 

V/ - V 2 v A*) ’ 

Eqs. (10.66) then become 

x = 

t = 

All we’ve done so far is make a change of variables. But we now make the following 
claim. 

Claim 10.1 V/ is independent ofv. 

Proof: As stated in the last remark in Section 10.3.1, we know that two successive 
applications of the transformations in eq. ( 10 . 68 ) must again yield a transformation 
of the same form. 

Consider a transformation characterized by velocity v \, and another one charac¬ 
terized by velocity V 2 ■ For simplicity, define 


so that A v = 


y/l ~ V 2 /V(f ‘ 


(10.67) 


y/l -v*/V* 
y/l ~ V‘ 2 /V(? (' V.f 


(x' + vt 1 ), 

+A 


( 10 . 68 ) 


Vi = V Vl , 
1 

y/l -v\/V? ’ 


V 2 = V v 

72 = 




(10.69) 


To calculate the composite transformation, it is easiest to use matrix notation. 
Looking at eq. (10.68), we see that the composite transformation is given by the 
matrix 


( 72 72^2 \ / 71 7Wl \ _ / 1 + ^ V2 + V2 \ 

I 72 # 72 A 71 # 71 )- 7172 v 1 + W) ’ 


(10.70) 


The composite transformation must still be of the form of eq. (10.68). But this 
implies that the upper-left and lower-right entries of the composite matrix must be 
equal. Therefore, V ) 2 = V?. Since this holds for arbitrary v\ and v 2 , we see that V,(f 
must be a constant, that is, independent of v. ■ 

Denote the constant value of V(f by V 2 . Then the coordinate transformations in 
eq. ( 10 . 68 ) become 


x 

t 


y/l ~ v 2 /V* 

1 

y/l ~ v*/V‘> 


(; X' + Vt'), 

(*' + 


(10.71) 
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We have obtained this result using only the relativity postulate. These trans¬ 
formations have the same form as the Lorentz transformations, eqs. (10.13). The 
only extra information in eqs. (10.13) is that V is equal to the speed of light, c. 
It is remarkable that we were able to prove so much by using only the relativity 
postulate. 

We can say a few more things. There are four possibilities for the value of V 2 . 
However, two of these are not physical. 

• V 2 = oo: This gives the Galilean transformations, x = x' + vt! and t = t'. 

• 0 < V 2 < oo: This gives transformations of the Lorentz type. V is the 
limiting speed of an object. 

• V 2 = 0: This case is not physical, because any nonzero value of v will make 
the 7 factor imaginary (and infinite). Nothing could ever move. 

• V 2 < 0: It turns out that this case is also not physical. You might be 
concerned that the square of V is less than zero, but this is fine because V 
appears in the transformations (10.71) only through its square. There’s no 
need for V to actually be the speed of anything. The trouble is that the 
nature of eqs. (10.71) implies the possibility of time reversal. This opens the 
door for causality violation and all the other problems associated with time 
reversal. We therefore reject this case. 

To be a little more explicit, define b 2 = —V 2 , where b is a positive number. 
Then eqs. (10.71) may be written in the form, 

x = x 1 cos 9 + (bt 1 ) sin 9, 

bt = — x' sin 9 + (bt') cos 9, (10.72) 

where tan 9 = v/b. This transformation is simply a rotation in the plane, 
through an angle of —9. We have the usual trig functions here, instead of the 
hyperbolic trig functions in the Lorentz transformations in eq. (10.55). 

Eqs. (10.72) satisfy the requirement that the composition of two transforma¬ 
tions is again a transformation of the same form. Rotation by 9\, and then by 
92, yields a rotation by 9\ +02- However, if the resulting rotation is through 
an angle, 9, that is greater than 90°, then we have a problem. The tangent of 
such an angle is negative. Therefore, tan 9 = v/b implies that v is negative. 
The situation is shown in Fig. 10.29. Frame S" moves at velocity V 2 > 0 
with respect to frame S', which moves at velocity Vi > 0 with respect to 
frame S. From the figure, we see that someone standing at rest in frame S" 
(that is, someone whose worldline is the bt!'- axis) is going to have some serious 
issues in frame S. For one, the bt"-axis has a negative slope in frame S, which 
means that as t increases, x decreases. The person will therefore be moving 
at a negative velocity with respect to S. Adding two positive velocities and 
obtaining a negative one is clearly absurd. But even worse, someone standing 
at rest in S" will be moving in the positive direction along the bt"-axis, which 


bt 



Figure 10.29 
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means that he will be traveling backwards in time in S. That is, he will die 
before he is born. This is not good. 

Note that all of the finite 0 < V 2 < oo possibilities are essentially the same. 
Any difference in the numerical definition of V can be absorbed into the definitions 
of the unit sizes for x and t. Given that V is finite, it has to be something, so it 
doesn’t make sense to put much importance on its numerical value. 

There is therefore only one decision to be made when constructing the spacetime 
structure of an (empty) universe. You just have to say whether V is finite or infinite, 
that is, whether the universe is Lorentzian or Galilean. Equivalently, all you have 
to say is whether or not there is an upper limit for the speed of any object. If there 
is, then you can simply postulate the existence of something that moves with this 
limiting speed. In other words, to create your universe, you simply have to say, “Let 
there be light.” 
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10.9 Exercises 

Section 10.2: The fundamental effects 

1. Effectively speed c * 

A rocket flies between two planets that are one light-year apart. What should 
the rocket’s speed be so that the time elapsed on the captain’s watch is one 
year? 

2. A passing train 

A train of length 15 cs moves at speed 3c/5. 29 How much time does it take 
to pass a person standing on the ground (as measured by that person)? Solve 
this by working in the frame of the person, and then again by working in the 
frame of the train. 

3. Passing trains * 

Train A has length L. Train B moves past A with relative speed 4c/5, in 
the same direction. The length of B is such that A says that the fronts of 
the trains coincide at exactly the same time as the backs coincide. What is 
the time difference between the fronts coinciding and the backs coinciding, as 
measured by B1 

4. Seeing the light * 

A and B leave from a common point and travel in opposite directions with 
relative speed 4c/5. When B 's clock shows that a time T has elapsed, he (B) 
sends out a light signal. When A receives the signal, what time does his (A’s) 
clock show? Answer this question by doing the calculation entirely in (a) A’s 
frame, and then (b) B’s frame. 

5. Walking on a train * 

A train of proper length L and speed 3c/5 approaches a tunnel of length L. 
At the moment when the front of the train enters the tunnel, a person leaves 
the front of the train and walks (briskly) toward the back. He arrives at the 
back of the train right when it leaves the tunnel. 

(a) How much time does this take in the ground frame? 

(b) What is the person’s speed with respect to the ground? 

(c) How much time elapses on the person’s watch? 

6. Simultaneous waves * 

Alice flies past Bob at speed v. Right when she passes, they both set their 
watches to zero. When Alice’s watch shows a time T, she waves to Bob. 
Bob then waves to Alice simultaneously (as measured by him) with Alice’s 
wave. Alice then waves to Bob simultaneously (as measured by her) with 

29 1 cs is one “light-second.” It equals (1)(3 • 10 8 m/s)(l s) = 3 • 10 8 m. 
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Bob’s wave. Bob then waves to Alice simultaneously (as measured by him) 
with Alice’s second wave. And so on. What are the readings on Alice’s watch 
for all the times she waves? And likewise for Bob? 

7. Triplets * 

Triplet A stays on the earth. Triplet B travels at speed 4c/5 to a planet (a 
distance L away) and back. Triplet C travels out to the planet at speed 3c/4, 
and then returns at the necessary speed to arrive back exactly when B does. 
How much does each triplet age during this process? Who is youngest? 

8. People clapping ** 

Two people stand a distance L apart along an east-west road. They both clap 
their hands at precisely noon in the ground frame. You are driving eastward 
down this road at speed 4c/5. You notice that you encounter the western 
person at the same instant (as measured in your frame) that the eastern person 
claps. Where are you along the road at the instant (as measured in your frame) 
that the western person claps? 

9. Coinciding runner ** 

A train of length L moves at speed 4c/5 eastward, and a train of length 3 L 
moves at speed 3c/5 westward. How fast must someone run along the ground 
if he is to coincide with both the fronts-passing-each-other and backs-passing- 
each-other events? 

10. Photon on a train ** 

(a) A train of proper length L moves with speed v with respect to the ground. 
At the instant the back of the train passes a certain tree, someone at the 
back of the train shines a photon toward the front. The photon happens 
to hit the front of the train at the instant the front passes a certain 
house. As measured in the ground frame, how far apart are the tree and 
the house? Solve this by working in the ground frame. 

(b) Now look at the setup from the point of view of the train frame. Using 
your result for the tree-house distance from part (a), verify that the house 
meets the front of the train at the same instant the photon meets it. 

11. Another train in a tunnel *** 

Consider the scenario of Problem 6, with the only change being that the train 
now has length rt, where r is some numerical factor. 

What is the largest value of r, in terms of v, for which it is possible for the 
bomb to not explode? (Verify that you obtain the same answer working in the 
frame of the train and working in the frame of the tunnel.) 

12. Falling stick ** 

A horizontal falling stick bounces off the ground. What does this look like in 
the frame of someone running by at speed v? 
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13. Through the hole? ** 

A stick of proper length L moves at speed v in the direction of its length. It 
passes over a infinitesimally thin sheet that has a hole of diameter L cut out 
of it. As the stick passes over the hole, the sheet is raised, and the stick then 
ends up underneath the sheet. 

In the lab frame, the stick’s length is contracted to L/ 7 , so it appears to easily 
makes it through the hole. But in the stick frame, the hole is contracted to 
L/ 7 , so it seems like the stick does not make it through the hole. So the 
question is: Does the stick make it through the hole or not? 

Section 10.3: The Lorentz transformations 

14. Successive L.T.’s * 

Show that the combination of an L.T. (with speed v\) and an L.T. (with speed 
V 2 ) yields an L.T. with speed (ui + U 2 )/(l + ni^/c 2 ). 

15. Loss of simultaneity * 

A train moves at speed v with respect to the ground. Two events occur 
simultaneously, a distance L apart, in the frame of the train. What is the 
time separation in the frame of the ground? Solve this by using the Lorentz 
transformations, and then again by using only the results in Section 10.2. 

16. Pythagorean triples * 

Let (a, b, h ) be a Pythagorean triplet. (We’ll use h to denote the hypotenuse, 
instead of c, for obvious reasons.) Consider the relativistic addition or sub¬ 
traction of the two speeds, /3\ = a/h and (h = b/h. Show that the numerator 
and denominator of the result are members of another pythagorean triplet, 
and find the third member. What is the associated 7 factor? 

17. Running on a train * 

A train of length L moves at speed v\ with respect to the ground. A passenger 
runs from the back of the train to the front, at speed V 2 with respect to the 
train. How much time does this take, as viewed by someone on the ground? 
Solve this problem in two different ways: 

(a) Find the relative speed of the passenger and the train (as viewed by 
someone on the ground), and then find the time it takes for the passenger 
to erase the initial “head start” that the front of the train had. 

(b) Find the time it takes on the passenger’s clock, and then use time dilation 
to get the time elapsed on a ground clock. 

18. Running away * 

A and B both start at the origin and simultaneously head off in opposite 
directions at speed 3c/5. A moves to the right, and B moves to the left. 
Consider a mark on the ground at x = L. As viewed in the ground frame, A 
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and B are of course a distance 2 L apart when A passes this mark. As viewed 
by A, how far away is B when A coincides with the mark? 

19. Velocity addition ** 

Derive the velocity-addition formula by using the following setup: A train of 
length L moves at speed a with respect to the ground, and a ball is thrown at 
speed b with respect to the train, from the back to the front. Let the speed of 
the ball with respect to the ground be V. 

Calculate the time of the ball’s journey, as measured by an observer on the 
ground, in the following two different ways, and then set them equal to solve 
for V in terms of a and b. 

(a) First way: Find the relative speed of the ball and the train (as viewed 
by someone on the ground), and then find the time it takes for the ball 
to erase the initial “head start” that the front of the train had. 

(b) Second way: Find the time elapsed on the ball’s clock, and then use time 
dilation to get the time elapsed on a ground clock. 

20. Bullets on a train ** 

A train moves at speed v. Bullets are successively fired at speed u (relative to 
the train) from the back of a train to the front. A new bullet is fired at the 
instant (as measured in the train frame) the previous bullet hits the front. In 
the frame of the ground, what fraction of the way along the train is a given 
bullet, at the instant (as measured in the ground frame) the next bullet is 
fired? What is the maximum number of bullets an observer on the ground can 
see in flight at any given instant? 

21. Some 7 ’s 

Show that the relativistic addition (or subtraction) of the velocities u and v 
has a 7 factor given by 7 = 7 , 1 7^(1 ± uv). 

22. Time dilation and Lv/c 2 ** 

A person walks very slowly at speed u from the back of a train of length L to 
the front. The time-dilation effect in the train frame can be made arbitrarily 
small by picking u to be sufficiently small (because the effect is of second-order 
in u). Therefore, if the person’s watch agrees with a clock at the back of the 
train when he starts, then it will also agree with a clock at the front when he 
finishes. 

Now consider this situation from the ground frame, where the train moves at 
speed v. The rear clock reads Lv /c 2 more than the front, so in view of the 
preceding paragraph, the person’s watch must gain a time that is Lv/c 2 less 
than the time gained by the front clock during the process. By working in the 
ground frame, explain why this is the case. 
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23. Angled photon * 

A photon moves at an angle 9 with respect to the a^-axis in frame S'. Frame 
S' moves at speed v with respect to frame S (along the x' axis). Calculate the 
components of the photon’s velocity in S, and verify that the speed is c. 

Section 10.The invariant interval 

24. Head start * 

Derive the Lv/c 2 “head-start” result (given in eq. 10.2) by making use of the 
invariant spacetime interval. 

25. Passing trains *** 

Train A of length L moves eastward at speed v. and train B of length 2 L 
moves westward also at speed v. How much time does it take for the trains 
to pass each other (defined as the time between the front of B coinciding with 
the front of A, and the back of B coinciding with the back of A): 

(a) As viewed by A? 

(b) As viewed by B ? 

(c) As viewed by the ground? 

(d) Verify that the invariant interval is indeed the same in all three frames. 

26. Passing a train * 

Person C stands on the ground. Train B (with proper length L ) moves to the 
right at speed 3c/5. And person A runs to the right at speed 4c/5. A starts 
behind the train and eventually passes it. 

Let event E\ be “A passes the back of the train,” and let event E 2 be “A 
passes the front of the train.” Find At and Ax between events E\ and E 2 in 
the frames of A, B. and C. And show that (AAt 2 — Ax 2 is the same in all 
frames. 

27. Throwing on a train ** 

A train with proper length L moves at speed 3c/5 with respect to the ground. 
A ball is thrown from the back to the front, at speed c/2 with respect to the 
train. How much time does this take, and what distance does the ball cover, 
in: 

(a) The train frame? 

(b) The ground frame? Solve this by 

i. Using a velocity-addition argument. 

ii. Using the Lorentz transformations to go from the train frame to the 
ground frame. 

(c) The ball frame? 

(d) Verify that the invariant interval is indeed the same in all three frames. 
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(e) Show that the times in the ball frame and ground frame are related by 
the relevant 7 -factor. 

(f) Ditto for the ball frame and train frame. 

(g) Show that the times in the train frame and ground frame are not related 
by the relevant 7 -factor. Why not? 

Section 10.5: Minkowski diagrams 

28. Time dilation via Minkowski * 

In the spirit of the example in Section 10.5, use a Minkowski diagram to derive 
the time-dilation result between frames S and S' (in both directions, as in the 
example). 

29. Simultaneous claps *** 

With respect to the ground, A moves to the right at speed c/V 3, and B 
moves to the left, also at speed c/V 3. At the instant they are a distance d 
apart (as measured in the ground frame), A claps his hands. B then claps 
his hands simultaneously (as measured by B ) with A’s clap. A then claps his 
hands simultaneously (as measured by A) with B' s clap. B then claps his 
hands simultaneously (as measured by B ) with A’s second clap, and so on. As 
measured in the ground frame, how far apart are A and B when A makes his 
nth clap? What is the answer if c/V 3 is replaced by a general speed vl 

30. Train in tunnel ** 

Repeat Exercise 11, but now solve it by using a Minkowski diagram. (Do this 
from the point of view of the train, and also of the tunnel.) 

Section 10.6: The Doppler effect 

31. Transverse Doppler ** 

As mentioned in Remark 3 of Section 10.6.2, it is possible to solve the trans¬ 
verse Doppler effect for Case 1 by working in the frame of the observer, pro¬ 
vided that you account for the longitudinal component of the source’s motion. 
Solve the problem this way and reproduce eq. (10.49). 

32. Twin paradox via Doppler ** 

Twin A stays on earth, and twin B flies at speed v to a distant star and back. 
The star is a distance L from the earth. Use the Doppler effect to show that 
B is younger by a factor 7 when she returns. Do this in two ways: 

(a) A sends out flashes at 1-second intervals. Let Ta and Tb be the total 
times on A’s and B's clocks. In terms of Tb, how many red- and blue- 
shifted flashes does B receive? Equate the total number of flashes B 
receives with with the total number of flashed A emits, to show that 
T b = T A / 7 . 
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(b) B sends out flashes at 1-second intervals. In terms of L and v. how many 
red- and blue-shifted flashes does A receive? The total number of flashes 
A receives is simply the total time on B’s clock. Use this to show that 

t b = T A / r 
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Figure 10.30 


10.10 Problems 

Section 10.2: The fundamental effects 

1. No transverse length contraction * 

Two meter sticks, A and B. move past each other as shown in Fig. 10.30. 
Stick A has paint brushes at its ends. Use this setup to show that in the frame 
of one stick, the other stick still has a length of one meter. 

2. Explaining time dilation ** 

Two planets, A and B, are at rest with respect to each other, a distance L 
apart, with synchronized clocks. A spaceship flies at speed v past planet A 
and synchronizes its clock with A’s (they both set their clocks to zero). It 
then flies past planet B and compares its clock to B' s. We know that when 
the spaceship reaches B, B' s clock will simply read L/v. And the spaceship’s 
clock will read L/jv, since it runs slow by a factor of 7 , compared to the 
planets’ clocks. 

How would someone on the spaceship quantitatively explain to you why B' s 
clock reads L/v (which is more than its own L/'yv), considering that the 
spaceship sees B' s clock running slow! 

3. Explaining Length contraction ** 

Two bombs lie on a train platform, a distance L apart. As a train passes by 
at speed v, the bombs explode simultaneously (in the platform frame) and 
leave marks on the train. Due to the length contraction of the train, we know 
that the marks on the train will be a distance 7 L apart when viewed in the 
train’s frame (since this distance is what is length-contracted down to the 
given distance L in the platform frame). 

How would someone on the train quantitatively explain to you why the marks 
are 7 L apart, considering that the bombs are only a distance L /7 apart in the 
train frame? 

4. A passing stick ** 

A stick of length L moves past you at speed v. There is a time interval between 
the front end coinciding with you and the back end coinciding with you. What 
is this time interval in 

(a) your frame? (Calculate this by working in your frame.) 

(b) your frame? (Work in the stick’s frame.) 

(c) the stick’s frame? (Work in your frame. This is the tricky one.) 

(d) the stick’s frame? (Work in the stick’s frame.) 

5. Rotated square * 

A square with side L flies past you at speed v, in a direction parallel to two of 
its sides. You stand in the plane of the square. When you see the square at its 
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nearest point to you, show that it looks to you like it is simply rotated, instead 
of contracted. (Assume that L is small compared to the distance between you 
and the square.) 

6. Train in a tunnel ** 

A train and a tunnel both have proper lengths L. The train speeds toward 
the tunnel, with speed v. A bomb is located at the front of the train. The 
bomb is designed to explode when the front of the train passes the far end of 
the tunnel. A deactivation sensor is located at the back of the train. When 
the back of the train passes the near end of the tunnel, this sensor tells the 
bomb to disarm itself. Does the bomb explode? 

7. Seeing behind the stick ** 

A ruler is positioned perpendicular to a wall. A stick of length L flies by at 
speed v. It travels in front of the ruler, so that it obscures part of the ruler 
from your view. When the stick hits the wall it stops. 

In your reference frame, the stick is shorter than L. Therefore, right before it 
hits the wall, you will be able to see a mark on the ruler that is less than L 
units from the wall (see Fig. 10.31). 

But in the stick’s frame, the marks on the ruler are closer together. Therefore, 
when the wall hits the stick, the closest mark to the wall that you can see on 
the ruler is greater than L units (see Fig. 10.31). 

Which view is correct (and what is wrong with the incorrect one)? 

8. Cookie cutter ** 

Cookie dough (chocolate chip, of course) lies on a conveyor belt which moves 
along at speed v. A circular stamp stamps out cookies as the dough rushes by 
beneath it. When you buy these cookies in a store, what shape are they? That 
is, are they squashed in the direction of the belt, stretched in that direction, 
or circular? 


ruler stick 


(lab frame) v 


_ / 

(stick frame) 

Figure 10.31 


9. The twin paradox ** 

Consider the usual twin paradox: Person A stays on the earth, while person 
B flies quickly to a distant star and back. B is younger than A when they 
meet up again. The paradox is that one might argue that although A will see 
B’s clock moving slowly, B will also see A’s clock moving slowly, so A should 
be younger than B. 

There are many resolutions to this “paradox”. Perform the following one: Let 
B’s path to the distant star be lined with a wire that periodically zaps B as he 
flies along (see Fig. 10.32). Let this be accomplished by having every point in 
the wire emit a ‘zap’ simultaneously in A’s frame. Let t A be the time between 
zaps in A’s frame. Find the time between zaps in B’s frame, and then use the 
fact that both A and B agree on the total number of times B gets zapped. 



earth star 

Figure 10.32 
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Figure 10.33 


10. Velocity addition from scratch *** 

A ball moves at speed v\ with respect to a train. The train moves at speed V 2 
with respect to the ground. What is the speed of the ball with respect to the 
ground? Solve this problem (that is, derive the velocity addition formula, eq. 
(10.28)) in the following way. (Do not use time dilation, length contraction, 
etc. Use only the fact that the speed of light is the same in any inertial frame.) 

Let the ball be thrown from the back of the train. At this instant, a photon 
is released next to it (see Fig. 10.33). The photon heads to the front of the 
train, bounces off a mirror, heads back, and eventually runs into the ball. In 
both frames, find the fraction of the way along the train the meeting occurs, 
and then equate these fractions. 


Figure 10.34 



Figure 10.35 


4c/5 ? 3c/5 

A C B 

Figure 10.36 


C 


Section 10.3: The Lorentz transformations 

11. A bunch of L.T.’s * 

Verify that the values of Ax and At in the table in the example in Section 10.4 
satisfy the Lorentz transformations between the six pairs of frames, namely 
AB, AC , AD, BC, BD, and CD. (See Fig. 10.34.) 

12. Many velocity additions ** 

An object moves at speed v\/c = (i\ with respect to S\, which moves at speed 
/?2 with respect to S 2 , which moves at speed /Tj with respect to S 3 , and so on, 
until finally S^v-i moves at speed with respect to Sjv (see Fig. 10.35). 
Show that the speed, /3n v), of the object with respect to Sn can be written as 

= (10.73) 

-UV + 1 N 

where 

N N 

and p n = rK 1 -/*)■ ( io - 74 ) 

13. Equal speeds 

A and B travel at 4c/5 and 3c/5, respectively, as shown in Fig. 10.36. How 
fast should C travel between them, so that he sees A and B approaching him 
at the same speed? What is this speed? 

14. More equal speeds = 1 = 

A moves at speed v, and B is at rest, as shown in Fig. 10.37. How fast must 
C travel, so that she sees A and B approaching her at the same rate? 

In the lab frame (ITs frame), what is the ratio of the distances CB and AC? 
The answer to this is very nice and clean. Can you think of a simple intuitive 
explanation for the result? 


Figure 10.37 
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15. Transverse velocity addition ** 

For the special case u' x = 0, the transverse velocity addition formula, eq. 
(10.35), yields u y = u' y / 7 . Derive this in the following way. 

In frame S' , a particle moves with speed u! in the ^/-direction. Frame S moves 
to the left with speed v, so that the situation in S looks like that in Fig. 10.38, 
with the y-speed now u. Consider a series of equally spaced dotted lines, as 
shown. The ratio of times between passes of the dotted lines in frames S and 
S' is Ts/Tg’ = (1 /v)/(1 /v!) = u'/u. Derive another expression for this ratio 
by using time dilation arguments, and then equate the two expressions to solve 
for u in terms of u' and v. 

16. Equal transverse speeds * 

In the lab frame, an object moves with velocity ( u x ,u y ), and you move with 
speed v in the ^-direction. What must v be so that you also see the object 
moving with speed u y in your y-direction? (One solution is of course v = 0 . 
Find the other one.) 

17. Relative speed * 

In the lab frame, two particles move with speed v along the paths shown in 
Fig. 10.39. The angle between the trajectories is 20. What is the speed of one 
particle, as viewed by the other? (This problem is posed again in Chapter 12, 
where it can be solved in a much simpler way, using 4-vectors.) 

18. Another relative speed ** 

In the lab frame, particles A and B move with speeds u and v along the paths 
shown in Fig. 10.40. The angle between the trajectories is 0. What is the 
speed of one particle, as viewed by the other? (This problem is posed again in 
Chapter 12, where it can be solved in a much simpler way, using 4-vectors.) 

19. Modified twin paradox *** 

Consider the following variation of the twin paradox. A, B, and C each have a 
clock. In A’ s reference frame, B flies past A with speed v to the right. When 
B passes A, they both set their clocks to zero. Also, in A’s reference frame, C 
starts far to the right and moves to the left with speed v. When B and C pass 
each other, C sets his clock to read the same as B' s. Finally, when C passes 
A, they compare the readings on their clocks. At this event, let A’ s clock read 
T a , and let C”s clock read T c . 

(a) Working in A’ s frame, show that T c = T A /q, where 7 = l/^/l — v 2 /c 2 . 

(b) Working in B' s frame, show again that T c = T A /^f. 

(c) Working in C’s frame, show again that T c = T A /^f. 


_ «' 


S' 


T r s 



Figure 10.38 



Figure 10.39 
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Figure 10.40 
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Figure 10.41 


Section 10.4-' The invariant interval 

20. Throwing on a train ** 

A train with proper length L moves at speed c/2 with respect to the ground. 
A ball is thrown from the back to the front, at speed c/3 with respect to the 
train. How much time does this take, and what distance does the ball cover, 
in: 

(a) The train frame? 

(b) The ground frame? Solve this by 

i. Using a velocity-addition argument. 

ii. Using the Lorentz transformations to go from the train frame to the 
ground frame. 

(c) The ball frame? 

(d) Verify that the invariant interval is indeed the same in all three frames. 

(e) Show that the times in the ball frame and ground frame are related by 
the relevant 7 -factor. 

(f) Ditto for the ball frame and train frame. 

(g) Show that the times in the train frame and ground frame are not related 
by the relevant 7 -factor. Why not? 


Section 10.5: Minkowski diagrams 

21. A new frame 

In one reference frame, Event 1 happens at x = 0, ct = 0, and Event 2 happens 
at x = 2, ct = 1. Find a frame where the two events are simultaneous. 

22. Minkowski diagram units * 

Consider the Minkowski diagram in Fig. 10.41. In frame S, the hyperbola 
c 2 t 2 -x 2 = 1 is drawn. Also drawn are the axes of frame S', which moves past 
S with speed v. Use the invariance of the interval s 2 = c 2 t 2 — x 2 to derive 
the ratio of the unit sizes on the ct' and ct axes (and check the result with eq. 
(10.43)). 

23. Velocity addition via Minkowski * 

An object moves at speed v\ with respect to frame S'. Frame S' moves at 
speed V 2 with respect to frame S. (in the same direction as the motion of the 
object). What is the speed, u, of the object with respect to frame S'? 

Solve this problem (i.e., derive the velocity addition formula) by drawing a 
Minkowski diagram with frames S and S', drawing the worldline of the object, 
and doing a little geometry. 
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24. Acceleration and redshift *** 


Use a Minkowski diagram to do the following problem: 


Two people stand a distance d apart. They simultaneously start accelerating 
in the same direction (along the line between them), each with proper accel¬ 
eration a. At the instant they start to move, how fast does each person see 
the other person’s clock tick? 

25. Break or not break? *** 

Two spaceships float in space and are at rest relative to each other. They 
are connected by a string (see Fig. 10.42). The string is strong, but it 
cannot withstand an arbitrary amount of stretching. At a given instant, the 
spaceships simultaneously start accelerating (along the direction of the line 
between them) with the same acceleration. (Assume they bought identical 
engines from the same store, and they put them on the same setting.) 

Will the string eventually break? 


n>.□> 


Figure 10.42 


Section 10.7: Rapidity 

26. Successive Lorentz transformations 

The Lorentz transformation in eq. (10.55) may be written in matrix form as 

( x \ _ ( cosh</> sinh</> \ ( x' \ nn7 ,x 

y ct J y sinh (f> cosing J y ct' J ' 

Show that by applying an L.T. with vi = tanh (f)\. and then another L.T. with 
V 2 = tanh 02 , you do indeed obtain the L.T. with v = tanh(^>i + fa). 

27. Accelerator’s time * 

A spaceship is initially at rest in the lab frame. At a given instant, it starts to 
accelerate. Let this happen when the lab clock reads t = 0 and the spaceship 
clock reads t' = 0. The proper acceleration is a. (That is, at time t' + dt'. the 
spaceship is moving at a speed adt' relative to the frame it was in at time t'.) 
Later on, a person in the lab measures t and t'. What is the relation between 
them? 
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| B (A's view) 

B (B’s view) 

Figure 10.43 


10.11 Solutions 


1. No transverse length contraction 

Assume that the paint brushes are capable of leaving marks on stick B if B is long 
enough, or if A short enough. The key fact we need here is the second postulate of 
relativity, which says that the frames of the sticks are equivalent. That is, if A sees B 
shorter than (or longer than) (or equal to) itself, then B also sees A shorter than (or 
longer than) (or equal to) itself. The contraction factor must be the same in going 
each way between the frames. 

Assume that A sees B shortened. Then B won’t reach out to the ends of A, so there 
will be no marks on B. But in this case, B must also see A shortened, so there will 
be marks on B (see Fig. 10.43). This is a contradiction. 

Likewise, if we assume that A sees B lengthened, we also reach a contradiction. Hence, 
they each must see the other stick as one meter long. 

2. Explaining time dilation 

The resolution to the apparent paradox is the “head start” that B’s clock has over 
A’s clock (in the spaceship frame). From eq. (10.2), we know that in the spaceship 
frame, B’s clock reads Lv/c 2 more than A’s. (The two stars may be considered to be 
at the ends of the train in the example in Section 10.2.1.) 

Therefore, what a person on the spaceship says is this: “My clock advances by L/'yv 
during the whole trip. I see B’s clock running slow by a factor 7, so I see B’s clock 
advance by only (L/ 7u)/q = L/j 2 v. However, B’s clock started not at zero but at 
Lv/c 2 . Therefore, the final reading on B’s clock when I get there is 



as it should be.” 


3. Explaining Length contraction 

The resolution to the apparent paradox is that the explosions do not occur simul¬ 
taneously in the train frame. As the platform rushes past the train, the rear bomb 
explodes before the front bomb explodes. The front bomb then gets to travel far¬ 
ther by the time it explodes and leaves its mark. The distance between the marks is 
therefore larger than you might naively expect. Let’s be quantitative about this. 

Let the two bombs contain clocks that read a time t when they explode (they are 
synchronized in the ground frame). Then in the frame of the train, the front bomb’s 
clock reads only t —Lv/c 2 when the rear bomb explodes when showing a time t. (This 
is the “head start” result from eq. (10.2).) The front bomb’s clock must therefore 
advance by a time of Lv/c 2 before it explodes. But since the train sees the bombs’ 
clocks running slow by a factor 7, we conclude that in the frame of the train, the 
front bomb explodes a time 7 Lv/c 2 after the rear bomb explodes. During this time 
of 7 Lv/c 2 , the platform moves a distance (7 Lv/c 2 )v relative to the train. 

Therefore, what a person on the train says is this: “Due to length contraction, the 
distance between the bombs is L/7. The front bomb is therefore a distance L /7 ahead 
of the rear bomb when the latter explodes. The front bomb then travels an additional 
distance of L'yv 2 /c 2 by the time it explodes, at which point it is a distance of 


L 

7 


i-MKK)-* 


ahead of the rear bomb’s mark, as we wanted to show.” 


(10.77) 
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4. A passing stick 


(a) The stick has length L /7 in your frame. It moves with speed v. Therefore, the 
time taken in your frame to cover the distance of L/7 is L/yv. 

(b) The stick sees you fly by at speed v. The stick has length L in its own frame, 
so the time elapsed in the stick frame is simply L/v. During this time, the stick 
will see the watch on your wrist run slow, by a factor 7. Therefore, a time of 
L/'yv elapses on your watch, in agreement with part (a). 


Remark: Logically, the two solutions (a) and (b) differ in that one uses length 

contraction and the other uses time dilation. Mathematically, they differ simply in the 
order in which the divisions by 7 and v occur. X 
(c) You see the rear clock on the train showing a time of Lv/(? more than the front 
clock. In addition to this head start, more time will of course elapse on the rear 
clock by the time it reaches you. The time in your frame is L/'yv (since the 
train has length L /7 in your frame). But the train’s clocks run slow, so a time 
of only L/j 2 v will elapse on the rear clock by the time it reaches you. The total 
time that the rear clock shows is therefore 


Lv L L (v 2 1\ L (v 2 (, v 2 \\ L 

in agreement with the quick calculation below in part (d). 

(d) The stick sees you fly by at speed v. The stick has length L in its 
so the time elapsed in the stick frame is simply L/v. 


(10.78) 


own frame, 


5. Rotated square 

Fig. 10.44 shows a top view of the square at the instant (in your frame) when it is 
closest to you. Its length is contracted along the direction of motion, so it takes the 
shape of a rectangle with sides L and L/7. This is what the shape is in your frame 
(where Is-ness is defined by where all the points of an object are at simultaneous 
times). But what does the square look like to you? That is, what is the nature of the 
photons hitting your eye at a given instant? 30 

Photons from the far side of the square have to travel an extra distance L to get 
to your eye, compared to ones from the near side. So they need an extra time L/c 
of flight. During this time L/c, the square moves a distance Lv/c = L/3 sideways. 
Therefore, referring to Fig. 10.45, a photon emitted at point A reaches your eye at 
the same time as a photon emitted from point B. 

This means that the trailing side of length L spans a dis tance L j3 across your field 
of vision, while the near side spans a distance L/7 = Ls/t:— ft 2 across your field of 
vision. But this is exactly what a rotated square of side L looks like, as shown in 
Fig. 10.46 (where the angle of rotation satisfies sin# = [!). 

6. Train in a tunnel 

Yes, the bomb explodes. This is clear in the frame of the train (see Fig. 10.47). 
In this frame, the train has length L, and the tunnel speeds past it. The tunnel is 

30 In relativity problems, we virtually always subtract off the time it takes light to travel from the 
object to your eye (that is, we find out what really is). Along with the Doppler effect discussed in 
Section 10.6, this problem is one of the few exceptions where we actually want to determine what 
your eye registers. 


L/y 


L 


Figure 10.44 



^P lVtF 

Figure 10.45 



Figure 10.47 









(tunnel frame) 
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Figure 10.48 


length-contracted down to L/ 7. Therefore, the far end of the tunnel passes the front 
of the train before the near end passes the back, and so the bomb explodes. 

We may, however, look at things in the frame of the tunnel (see Fig. 10.48). Here the 
tunnel has length L, and the train is length-contracted down to L/j. Therefore, the 
deactivation device gets triggered before the front of the train passes the far end of 
the tunnel, so you might think that the bomb does not explode. We appear to have 
a paradox. 

The resolution to this paradox is that the deactivation device cannot instantaneously 
tell the bomb to deactivate itself. It takes a finite time for the signal to travel the 
length of the train from the sensor to the bomb. It turns out that this transmission 
time makes it impossible for the deactivation signal to get to the bomb before the 
bomb gets to the far end of the tunnel, no matter how fast the train is moving. Let’s 
prove this. 

The signal has the best chance of winning this “race” if it has speed c. So let us 
assume this is the case. Now, the signal gets to the bomb before the bomb gets to 
the far end of the tunnel if and only if a light pulse emitted from the near end of 
the tunnel (at the instant the back of the train goes by) reaches the far end of the 
tunnel before the front of the train does. (That sentence was a mouthful.) The former 
takes a time L/c. The latter takes a time L( 1 — 1/7 )/v (since the front of the train 
is already a distance L /7 through the tunnel). So if the bomb is to not explode, we 
must have 



L( 1 - l/7)/« 

1 - y/1 -/ 3 2 
1 -/? 


(10.79) 


This can never be true. Therefore, the signal always arrives too late, and the bomb 
always explodes. 

7. Seeing behind the stick 

The first reasoning is correct. You will be able to see a mark on the ruler that is less 
than L units from the wall. (You will actually be able to see a mark even closer to 
the wall than L/ 7, as we’ll show below). The main point of this problem (and many 
other ones) is that signals do not travel instantaneously. The back of the stick does 
not know that the front of the stick has come into contact with the wall until a finite 
time has passed. Let’s be quantitative about this and calculate (in both frames) the 
closest mark to the wall that you can see. 

Consider your reference frame. The stick has length L/ 7. Therefore, when the stick 
hits the wall, you can see mark a distance L /7 from the wall. You will, however, be 
able to see a mark even closer to the wall, because the back end of the stick will keep 
moving forward, since it doesn’t yet know that the front end has hit the wall. The 
stopping signal (shock wave, etc.) takes time to travel. 

Let’s assume that the stopping signal travels along the stick at speed c. (You can 
work with a general speed u. But the speed c is simpler, and it yields an upper bound 
on the closest mark you can see.) Where will the signal reach the back end? Starting 
from the time the stick hits the wall, the signal travels backwards from the wall at 
speed c, and the back end of the stick travels forwards at speed v (from a point L /7 
away from the wall). So the relative speed (as viewed by you) of the signal and the 
back end is c+ v. Therefore, the signal hits the back end after a time (L/7)/(c + v). 
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During this time, the signal has traveled a distance c(L/j)/(c+v) from the wall. The 
closest point to the wall that you can see is therefore the 


L 

7(1+ /?) 



(10.80) 


mark on the ruler. 

Now consider the stick’s reference frame. The wall is moving to the left towards it at 
speed v. After the wall hits the right end of the stick, the signal moves to the left with 
speed c, and the wall keeps moving to the left with speed v. Where is the wall when 
the signal reaches the left end? The wall travels v/c as, fast as the signal, so it travels 
a distance Lv/c in the time that the signal travels the distance L. This means that it 
is L(1 — v/c) away from the left end of the stick. In the stick’s frame, this corresponds 
to a distance / yL( 1 - v/c) on the ruler (because the ruler is length-contracted). So the 
left end of the stick is at the 


L 1 (1-P) = L^ 1 y -^ (10.81) 

mark on the ruler, in agreement with eq. (10.80). 

8. Cookie cutter 

Let the diameter of the cookie cutter be L, and consider the two following reasonings. 

• In the lab frame, the dough is length-contracted, so the diameter L corresponds 
to a distance larger than L (namely 7 L) in the dough’s frame. Therefore, when 
you buy a cookie, it is stretched out by a factor 7 in the direction of the belt. 31 

• In the frame of the dough, the cookie cutter is length-contracted in the direction 
of motion. It has length L/ 7. So in the frame of the dough, the cookies have a 
length of only L/ 7. Therefore, when you buy a cookie, it is squashed by a factor 
7 in the direction of the belt. 

Which reasoning is correct? The first one is. The cookies are stretched out. The 
fallacy in the second reasoning is that the various parts of the cookie cutter do not 
strike the dough simultaneously in the dough frame. What the dough sees is this: 
The cutter moves to, say, the left. The right side of the cutter stamps the dough, 
then nearby parts of the cutter stamp it, and so on, until finally the left side of the 
cutter stamps the dough. But by this time the front (that is, the left) of the cutter 
has moved farther to the left. So the cookie turns out to be longer than L. It takes 
a little work to demonstrate that the length is actually 7 L, but let’s do that now (by 
working in the dough frame). 

Consider the moment when the the rightmost point of the cutter strikes the dough. 
In the dough frame, a clock at the rear (the right side) of the cutter reads Lv/c 2 more 
than a clock at the front (the left side). The front clock must therefore advance by 
Lv/c 2 by the time it strikes the dough. (This is true because all points on the cutter 
strike the dough simultaneously in the cutter frame. Hence, all cutter clocks read the 
same when they strike.) But due to time dilation, this takes a time 7 (Lv/c 2 ) in the 
dough frame. During this time, the cutter travels a distance ^(7 Lv/c 2 ). Since the 

31 The shape is an ellipse, since that’s what a stretched-out circle is. The eccentricity of an ellipse 
is the focal distance divided by the semi-major axis length. As an exercise, you can show that this 
equals /3 = v/c here. 
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ct 



Figure 10.49 


ct 
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front of the cutter was initially a distance L /7 (due to length contraction) ahead of 
the back, the total length of the cookie in the dough frame equals 



= 7 L, 


as we wanted to show. 

9. The twin paradox 

The main point is that because the zaps occur simultaneously in A’s frame, they do 
not occur simultaneously in B’s frame. The zaps further ahead of B occur earlier (as 
you can show), so there is less time between zaps in B ’s frame than one might think 
(since B is constantly moving toward the zaps ahead, which happen earlier). 
Consider two successive zapping events. The Ax B between them is zero, so the Lorentz 
transformation A t A = j(At B — vAx B /c?) gives 

At B = A t A /j. (10.82) 

So if t A is the time between zaps in A’s frame, then t B = f A /7 is the time between 
zaps in B’s frame. (This is the usual time dilation result.) 

Let N be the total number of zaps B gets. Then the total time in A’s frame is 
T a = Nt A , while the total time in B’s frame is T B = Nt B = N(t A / 7). Therefore, 

T b = —. (10.83) 

7 


So B is younger. 

This can all be seen quite clearly if we draw a Minkowski diagram. Fig. 10.49 shows 
our situation where the zaps occur simultaneously in A’s frame. We know from eq. 
(10.43) that the unit size on B’s ct axis on the paper is' y(l -j- 0 2 )/(l — f) 2 ) times the 
unit size of A’s ct axis. Since the two pieces of B’s ct axis are y1 A<:0 2 times as long 
as the corresponding piece of A’s ct axis, we see that only \/l — /f 2 as many time 
units fit on B’s worldline as fit on A’s worldline. So B is younger. 

Remark: Eq. (10.83) is the result we wanted to show. But the obvious question is: Why 
doesn’t it work the other way around? That is, if we let A be zapped by zaps along a wire 
that occur simultaneously in B’s frame, why don’t we conclude that A is younger? The 
answer is that there is no one B frame; B has a different frame going out and coming in. 

As usual, the best way to see what is going on is to draw a Minkowski diagram. Fig. 10.50 
shows the situation where the zaps occur simultaneously in B’s frame. The lines of simul¬ 
taneity (as viewed by B) are tilted one way on the trip outward, and the other way on the 
trip back. The result is that A gets zapped frequently for a while, then no zaps occur for a 
while, then he gets zapped frequently again. The overall result, as we will now show, is that 
more time elapses in A’s frame than in B’s frame. 

Let the distant star be a distance d from the earth, in A’s frame. Let the zaps occur 
at intervals At B in B’s frame. Then they occur at intervals At n /y in A’a frame. (The 
same factor applies to both parts of the journey.) Let there be N total zaps during the 


Figure 10.50 
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journey. Then the total times registered by B and A are, respectively, T B = NAt B , and 
T a = N(At B /q) + 1, where t is the time where no zaps occur in A’s frame (in the middle of 
the journey). The distance between the earth and star in B’s frame is vT B /2. But we know 
that this distance also equals d/ 7. Therefore, NAt B =>T B = 2d/(-yv). So we have 

2d 

^+ t. (10.84) 

We must now calculate t. Since the slopes of B’s lines of simultaneity in the figure are ±/3, 
we see that ct = 2d tan 6 = 2d/3. Therefore, 


2d 

2d 2dfi _ 2d 
-fS|) c V ' 


(10.85) 


Hence, T B =T A / 7. 

(We know of course, without doing any calculations, that T A = 2d/v. But it is reassuring 
to add up the times when A is getting zapped and when he is not getting zapped, to show 
that we still get the same answer.) £ 


10. Velocity addition from scratch 


The important point here is that the meeting of the photon and the ball occurs at 
the same fraction of the way along the train, independent of the frame. This is true 
because although distances may change depending on the frame, fractions remain the 
same (because length contraction doesn’t depend on position). Let’s compute the 
desired fraction in the train frame S', and then in the ground frame S. 


Train frame: Let the train have length L'. We’ll first need to find the time at 
which the photon meets the ball (see Fig. 10.51). Light takes a time 17/c to reach 
the mirror. At this time, the ball has traveled a distance tq(£//c). The separation 
between the light and the ball at this time is therefore L'( 1 — tq/c). From this point 
on, the relative speed of the light and the ball is c + tq. Hence, it takes them an 
additional time L'( 1 — tq /c)/(e + iq) to meet. The total time before the meeting is 
therefore 


L' 1/(1 - ui/c) _ 2 L 

c c+v 1 c + v 1 


( 10 . 86 ) 



(frame S') 

Figure 10.51 


The distance the ball has traveled is thus v\t' = 2v\L'/(c + tq), and the desired 
fraction F' is 

F' = 2Vl . (10.87) 

c + v 1 

Ground frame: Let the speed of the ball with respect to the ground be v. Let the 
train have length L. Again, we’ll first need to find the time at which the photon meets 
the ball (see Fig. 10.52). Light takes a time L/{c— vT) to reach the mirror (since 
the mirror is receding at a speed vT). At this time, the ball has traveled a distance 
vL/(c— v 2 ), and the light has traveled a distance cL/(c — v 2 )■ The separation between 
the light and the ball at this time is therefore Lie — v)/{c — v 2 ). From this point on, 
the relative speed of the light and ball is c+v. Hence, it takes them an additional 
time L(c — v)/[(c—V2)(c + v)\ to meet. The total time before the meeting is therefore 


L L(c — v) _ 2 cL 

c-v 2 (c-v 2 )(c+v) (c-v 2 )(c+v)' 



(frame S) 
v 2 



(finish) - 


( 10 . 88 ) 


Figure 10.52 
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The distance the ball has traveled is thus 2vcL/[(c —i>2)(c+u)]. But the distance the 
back of the train has traveled in this time is 2w 2 cL/[(c — v 2 )(c + »)]. So the distance 
between the ball and the back of the train is 2{y — r' 2 jcL/[(c — v 2 )(c+w)]. The desired 
fraction F is therefore 


2 (v — V 2 )c 
(c - V 2 )(c + u) 


(10.89) 


We can now equate the above expressions for F' and F. For convenience, define 
P = v/c, Pi = vi/c, and 02 = V 2 /c. Then F' = F yields 


Pi _ P ~ P 2 
1 + Pl ~ (1~P2)(1 + P) ' 


(10.90) 


Solving for P in terms of p\ and /? 2 gives 32 


0- 


P 1 + P 2 

1 + 0102 ’ 


(10.91) 


as desired. 


11. Throwing on a train 


(a) In the train frame, the distance is simply d= L. And the time is t 
3 L/c. 


(b) 


i. The velocity of the ball with respect to the ground 
v = c/2) 


5c 

y ■ 


is (with 


L/(c/3) = 
= c/3 and 
(10.92) 


The length of the train with respect to the ground is T/71/2 = i/SL/2. 
Therefore, at time t, the position of the front of the train is \/3L/2 + vt. 
And the position of the ball is V g t. These two positions are equal when 


C V a -v)t = 


V3L 


\/3 L 



7 L 
\/3c 


(10.93) 


Equivalently, this time is obtained by noting that the ball closes the initial 
head-start of \/3L/2 that the front of the train had, at a relative speed of 
V g - v. 33 

The distance the ball travels is then d = V g t = (5c/7)(7L/\/3c) = 5L/V3. 
ii. In the train frame, the space and time intervals are x' = L and t' = 3L/c, 
from part (a). The 7-factor between the frames is 7^2 = 2/\/3, so the 
Lorentz transformation give the coordinates in the ground frame as 


x 

t 


7(0/ + vt') = 
7 (t 1 + vx'/c 2 ) 


m 


2 (3L §(L) 

^3 V c c 2 


5 L 


V3’ 



(10.94) 


in agreement with above. 

32 N. David Mermin does this problem in Am. J. Phys., 51 , 1130 (1983), and then takes things 
one step further in Am. J. Phys., 52 , 119 (1984). 

33 Yes, it’s legal to simply subtract these velocities, because they’re both measured with respect 
to the same frame, namely the ground. 
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(c) In the ball frame, the train has length, -b/71/3 = \/8L/3. Therefore, the time 
it takes the train to fly (at speed c/3) past the ball is t = (v / 8-h/3)/(c/3) = 
2\/2 L/c. And the distance is d = 0, of course, because the ball doesn’t move in 
the ball frame. 

(d) The values of c 2 t 2 — x 2 in the three frames are: 

Train frame: c 2 t 2 — x 2 = c 2 (3L/c) 2 — L 2 = 8 L 2 . 

Ground frame: c 2 f 2 — x 2 = (?(7L/\/3c) 2 — (5L/\/3) 2 = 8 L 2 . 

Ball frame: c 2 t 2 - x 2 = c 2 (2\/2 L/c) 2 - (0) 2 = 8 L 2 . 

These are all equal, as they should be. 

(e) The relative speed between the ball frame and the ground frame is 5c/7. There¬ 
fore, 75/7 = 7/2\/6, and the times are indeed related by 

*** ~ ^ = 27? (^)' wh>chistme - <10 - 95) 

(f) The relative speed between the ball frame and the train frame is c/3. Therefore, 
7 i/ 3 = 3/2\/2, and the times are indeed related by 

3 L 3 /2 V2L\ , . , . 

it = 7 tb •<=>■ — = I - 1 , which is true. (10.96) 

(g) The relative speed between the train frame and the ground frame is c/2. There¬ 
fore, 71/2 = 2/\/3, and the times are not related by a simple time-dilation factor. 

tg ^ ltt «=* (m97) 

We don’t obtain an equality because we can use time dilation only if the two 
events happen at the same place in one of the frames. In this problem, the “ball 
leaving the rear” and the “ball hitting the front” events happen at the same 
place in the ball frame, but not in the train frame or the ground frame. 

12. A bunch of L.T.’s 

The relative speeds, and the associated 7 factors, for the six pairs of frames are (using 
the results from the examples in Sections 10.2.3, 10.3.3, and 10.4) 



AB 

AC 

AD 

BC 

BD 

CD 

V 

5c/13 

4c/5 

c/5 

3c/5 

c/5 

5c/7 

7 

13/12 

5/3 

5/2\/6 

5/4 

5/2\/6 

7 /2\J 6 


The separations between the two events in the four frames are (from the example in 
Section 10.4)) 



A 

B 

C 

D 

Ax 

-L 

L 

5 L 

0 

At 

5 L/c 

5 L/c 

IL/c 

2V6 L/c 
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The Lorentz transformations are 

x = 7 (x' + vt'), 

t = 7(t' + vx’/(?). (10.98) 

For each of the six pairs, we’ll transform from the faster frame to the slower frame. 
This means that the coordinates of the faster frame will be on the right-hand side of 
the L.T.’s. The sign on the right-hand side of the L.T.’s will therefore always be a 
“+”. In the AB case, for example, we will write, “Frames B and A,” in that order, to 
signify that the B coordinates are on the left-hand side, and the A coordinates are on 
the right-hand side. We’ll simply list the L.T.’s for the six cases, and you can check 
that they do indeed all work out. 

Frames B and A : L 

5 L 


Frames C and A : 5 L 


Frames D and A : 0 

2\/6 L 






( 10 . 100 ) 
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it should. When N = 2, the formula gives 


_ P 2 + ~ P 2 ~ _ (1 + A)(l + 02) ~ (1 ~ /?l)(l ~ ft) _ ft + ft 
^ P} + P'2 (1 + ft)(l + ft) + (1 — /?l) (1 — ft) 1 + /?l/?2 


( 10 . 101 ) 


which agrees with the velocity addition formula. 

Let’s now prove the formula for general N. We will use induction. That is, we will 
assume that the result holds for N and then show that it holds for IV + 1. To find 
the speed, ft/v+i), of the object with respect to ftv+i, we can relativistically add the 
speed of the object with respect to Sn (which is 0(n)) with the speed of Sn with 
respect to ftv+i (which is /3 N+ 1). This gives 


P(N+1) - 


ftv+1 + ft JV) 
1 + ftv+iftjv) 


Under the assumption that our formula holds for N, this becomes 


ftiV+1) 


ftv+i ' 


1 + ftv+] 


(Pj v + Pn) + ftv+1 (Pfit — Pn) 

P n(1 + ftv+ l) - Pn( 1 - 0N+ 1) 
u^(l + Pn+i) + -Fjv(1 - 0N+ 1) 


-^jy+1 P N+ 1 

P N + 1 + -^JV+1 


( 10 . 102 ) 


(10.103) 


as we wanted to show. We have therefore shown that if the result holds for N, then 
it holds for IV + 1. Since know that the result holds for IV = 1, it thus holds for all 
N. 

The expression for p( N ) has some expected properties. It is symmetric in the /3 t . And 
if at least one of the /% equals 1, then Pn= 0- This yields 0{n) = 1, as it should. 
And if at least one of the ft equals —1, then P x = 0. This yields 0( N ) = —1, as it 
should. 


Remark: We can use the result of this problem to derive the v(t ) given in eq. (10.62). 
First, note that if all the ft in this problem are equal, and if their common value is much 
less than 1, then 

Cl + B) n - (1- B) n e 0N - e~ pN 

= tanh(ftV). (10.104) 

Let /3 equal a dt/c, which is the relative speed of two frames at nearby times in the spaceship 
scenario leading up to eq. (10.62). If we let N = t/dt be the number of frames here (and if 
we take the limit dt —> 0), then we have reproduced the spaceship scenario. Therefore, the 
ftv) in eq. (10.104) equals the v(t) in eq. (10.62). With the present values of N and 0, eq. 
(10.104) gives ftjv) =tanh(ot/c), as desired. & 

14. Equal speeds 

First Solution: Let C move at speed v. If C sees both A and B approaching him 
at the same speed, then the relativistic subtraction of v from 4c/5 must equal the 
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relativistic subtraction of 3c/5 from v. Therefore, (dropping the c’s), 



(10.105) 


This yields 0 = 35u 2 — 74c + 35 = (5c — 7) (7c — 5). Since the c = 7/5 root represents 
a speed larger than c, our answer is 


5 

7 C ' 


(10.106) 


Plugging this back into eq. (10.105) gives the relative speed of C and both A and B 
as u = c/h. 


Second Solution: Let the relative speed of C and both A and B be u. Then the 
relativistic subtraction of u from 4c/5 must equal the relativistic addition of u to 
3c/5 (because both of the results equal the speed of C with respect to the ground). 
Therefore, 



(10.107) 


This yields 0 = 5u 2 — 26 u + 5 = (5« — 1)(« — 5). Since the 
speed larger than c, our answer is 

c 

U= 5 ' 


= 5 root represents a 
(10.108) 


Plugging this back into eq. (10.107) gives the speed of C with respect to the ground 
as v = 5c/7. 


Third Solution: The relative speed of A and B is (dropping the c’s) 



(10.109) 


From C’s point of view, this 5/13 is the result of relativistically adding 
u. Therefore, 


5 _ 2u 
13 ~~ 1 + u 2 


5 u 2 — 2 6u + 5 = 0, 


with another 


( 10 . 110 ) 


as in the second solution. 


15. More equal speeds 


Let u be the speed at which C sees A and B approaching her. From C”s point of 
view, v is the result of relativistically adding u with another u. Therefore (dropping 
the c’s), 


v = 


2 u 

T+v? 


( 10 . 111 ) 


Note that the quadratic equation for u has another solution with a plus sign in front 
of the square root, but this solution cannot be correct, because it goes to infinity as v 
goes to zero. The above solution for u has the proper limit as v goes to zero, namely 
u —> c/2, which is obtained by using the Taylor expansion for the square root. 
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The ratio of the distances CB and AC in the lab frame is the same as the ratio of 
the differences in velocity, 

CB _V C - V B 
AC V A - V c 


We see that C is 7 times as far from B as she is from A. Note that for nonrelativistic 
speeds, we have 7 « 1, and C is of course midway between A and B. 

An intuitive reason for the simple factor of 7 is the following. Imagine that A and 
B are carrying identical jousting sticks as they run toward C. Consider what the 
situation looks like when the tips of the sticks reach C. In the lab frame (in which B 
is at rest), B’s stick is uncontracted, but A’s stick is length-contracted by a factor 7. 
Therefore, A is closer to C than B is, by a factor 7. 

16. Transverse velocity addition 

Assume that a clock on the particle shows a time T between successive passes of the 
dotted li nes. In frame S', the speed of the particle is u', so the time dilation factor is 
7' = l/\/l —n' 2 . The time between successive passes of the dotted lines is therefore 
T S > = 7 'T. _ 

In frame S, the speed of the particle is y/v 2 + u 2 . (Yes, the Pythagorean theorem 
holds for the speeds, because both speeds are measured with respect to the same 
frame.) Hence, the time dilation factor is 7 = l/v/l — v 2 — u 2 . The time between 
successive passes of the dotted lines is therefore Tg = 7T. 

Equating our two expressions for Ts/Ts’ gives 



Solving for u gives 



v! 

lv ’ 


(10.113) 


(10.114) 


17. Equal transverse speeds 

From your point of view, the lab frame is moving with speed v in the negative x- 
direction. The transverse velocity addition formula, eq. (10.35), therefore gives the 
y-speed in your frame as u y /'y( 1 — u x v). Demanding that this equals u y gives 

7(1 ~u x v) = 1 =» y/l^v 2 = (1 -u x v) u = , (10.115) 


or v = 0, of course. 

Remark: This answer makes sense. The fact that v is simply the relativistic addition of u x 
with itself means that both your frame and the original lab frame move at speed u x (but in 
opposite directions) relative to the frame in which the object has no speed in the .T-direction. 
By symmetry, therefore, the y-speed of the object must be the same in your frame and in 
the lab frame. & 
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B 



S '• 

5 •---*- 

Figure 10.53 


18. Relative speed 

Consider the frame, S', traveling along with the point P midway between the particles. 
S' moves at speed v cos 9, so the 7 factor relating it to the lab frame is 


^ \/l — v 2 cos 2 6 


(10.116) 


Let’s find the vertical speeds of the particles in S'. Since the particles have u' x = 0, 
the transverse velocity-addition formula, eq. (10.35), gives usin# = u^/7. Therefore, 
each particle moves along the vertical axis away from P with speed 


u' y = qusin#. (10.117) 

The speed of one particle as viewed by the other can now be found via the velocity- 
addition formula, 


If desired, this can be written as 


vT V^cos 3 #’ 2u sin 9 \/1 — v 2 cos 2 9 

' 1 + ~ 1 — n 2 cos 29 


(10.118) 


V = 


(1-v 2 ) 2 

(1 — i> 2 cos 29) 2 


(10.119) 


Remark: If 26 = 180°, then V = 2v/(l + v 2 ), as it should. And if 9 = 0°, then V = 0, 
as it should. If 6 is very small, then the result reduces to V « 2v sin (j/yd v 2 . You can 
convince yourself that this makes sense. & 

19. Another relative speed 

Let the velocity u point in the x-direction, as shown in Fig. 10.53. Let S' be the lab 
frame, and let S be A’s frame (so frame S' moves at speed —u with respect to S). The 
x- and y-speeds of B in frame S' are v cos 9 and v sin 9. Therefore the longitudinal and 
transverse velocity-addition formulas, eqs. (10.28) and (10.35), give the components 
of B’s speed in S as 


V x 

Vy 


n cos 9 —u 
1 —uv cos 9 ’ 
usind 

7„(1 — uv cos 9) 



1 — uv cos 9 


( 10 . 120 ) 


The total speed of B in frame S (that is, from the point of view of A) is therefore 


V 





\/u 2 + v 2 — 2 uv cos 9 — u 2 v 2 sin 2 9 


1 — uv cos 9 


( 10 . 121 ) 


If desired, this can be written as 


V = 


(1 -u 2 )(l-v 2 ) 
(1 — uncos#) 2 


( 10 . 122 ) 
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The reason why this can be written in such an organized form will become clear in 
Chapter 12. 


Remark: If u = v, this reduces to the result of the previous problem (if we replace 9 by 29). 
If 9 = 180°, then V = (u+v)/(l+uv), as it should. And if 9 = 0°, then V = \v-u\/{l-uv), 
as it should. & 

20. Modified twin paradox 

(a) In A’s reference frame, the worldlines of A, B, C are shown in Fig. 10.54. B' s 
clock runs slow by a factor 1/7. Therefore, if A’s clock reads t when B meets 
C, then B’s clock will read f/7 when he meets C. So the time he gives to C is 
t/l- 

ln A’s reference frame, the time between this event and the event where C meets 
A is again t (since B and C travel at the same speed). But A sees C’s clock run 
slow by a factor 1/7, so A sees C’s clock increase by f/7. 

Therefore, when A and C meet, A’s clock reads 2 1, and C”s clock reads 24/7. 
That is, T c = T A / 7. 

(b) Now let’s looks at things in B's frame. The worldlines of A, B, and C are shown 
in Fig. 10.55. At first glance, you might think that C’s clock should read more 
than A’s clock when they meet (which would contradict the result in part (a)), 
for the following reason. From B's point of view, he sees A’s clock run slow, 
so at the moment he transfers his time to C, he sees C’s clock read more than 
A’s. The resolution of this apparent paradox is that during the remainder of 
the journeys, B sees C’s clock run much slower than A’s (because the relative 
speed of C and B is greater than the relative speed of A and B). So in the end, 
C’s clock shows less time than A’s. Let’s be quantitative about this. 

Let .B’s clock read t B when he meets C. Then at this time, B sees A’s clock 
read t B / 7, and he hands off the time t B to C. 

We must now determine how much additional time elapses on A’s clock and C’s 
clock, by the time they meet. From the velocity-addition formula, B sees C 
flying by to the left at speed 2v/(l + v 2 ). He also sees A fly by to the left at 
speed v, but A had a head-start of vt B in front of C. Therefore, if t is the time 
between the meeting of B and C and the meeting of A and C (as viewed from 
B), then 


2vt 

l+i 


j = vt + vt B . 


This gives 


(10.123) 


(10.124) 


During this time, B sees A’s and C’s clocks increase by t divided by the relevant 
time-dilation factor. For A, this factor is 7 = 1 /y 1 — and for C it is 


(10.125) 


Therefore, the total time shown on A’s clock when A and C meet is 


T a 


ct 

M 



Figure 10.54 

ct 



Figure 10.55 
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ct 



Figure 10.56 



Figure 10.57 



(10.126) 


And the total time shown on C’s clock when A and C meet is 



(10.127) 


- v 2 s T A / r 


(c) Now let’s work in C’s frame. The worldlines of A, B, and C are shown Fig. 10.56 
As in part (b), the relative speed of B and C is 2v/{l+v 2 ), and the time-dilation 
factor between B and C is (1 + v 2 )/(l — v 2 ). Also, as is part (b), let B and C 
meet when B’s clock reads t B . So this is the time that B hands off to C. We’ll 
find all relevant times below in terms of t B . 


C sees B’s clock running slow, so from C’s point of view, B travels for a time 
t B (1 + v 2 )/(l — v 2 ). In this time, B covers a distance (from where he passed A) 


equal to 



2v 

1 + v 2 


2 vt B 

’ 


(10.128) 


in C’s frame. A must of course travel this same distance (from where he met 
B) to meet up with C. 

We can now find T A . The time (as viewed by C) that it takes A to travel the 
distance d to reach C is d/v = 2t B /(l - v 2 ). But since C sees A’s clock run 
slow, A’s clock will read only 


T a 


(10.129) 


Now let’s find T c . To find T c , we must take t B and add to it the extra time it 
takes A to reach C, compared to the time it takes B to reach C. From above, 
this extra time is 2t B /(l — v 2 ) — t B ( 1 + v 2 )/(l — v 2 ) = t B . Therefore, C’s clock 
reads 

T c = 2 t B . (10.130) 

Hence, T c = T a VT^A = T A /r 

21. A new frame 

First Solution: Consider the Minkowski diagram in Fig. 10.57. Event 1 is at the 
origin, and Event 2 is at the point (2,1), in the frame S. 

Consider a frame S' whose x' axis passes through the point (2,1). Since all points 
on the x' axis are simultaneous in the S' frame (they all have t' = 0), we see that S' 
is the desired frame. From Section 10.5, the slope of the x' axis is equal to ft = v/c. 
Since the slope is 1/2, we have v = c/2. 

(Looking at our Minkowski diagram, it is clear that if v > c/2, then Event 2 occurs 
before Event 1 in the new frame. And if v < c/2, then Event 2 occurs after Event 1 
in the new frame.) 
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Second Solution: Let the original frame be S, and let the desired frame be S'. Let 
S' move at speed v with respect to S. Our goal is to find v. 

The Lorentz transformations are 


Ax' = 


Ax — vAt 

y/l-v 2 /c 2 ' 


At — vAx/c 2 
y/l~v 2 /c 2 


(10.131) 


We want to make At' equal to zero, so the second of these equations yields At — 
vAx/c 2 = 0, or v = c 2 At/Ax. We are given Ax = 2, and At = 1/c, so the desired v 


v = c 2 At/Ax = c/2. 


(10.132) 


|~«—g-—» | ■ | 

x=0 x=2 


Figure 10.58 


Third Solution: Consider the setup in Fig. 10.58, which explicitly constructs two 
such given events. Receivers are located at x = 0 and x = 2. A light source is located 
at x = 1/2. This source emits a flash of light, and when the light hits a receiver we 
will say an event has occurred. So the left event happens at x = 0, ct = 1/2; and 
the right event happens at x = 2, ct = 3/2. (We may shift our clocks by — l/(2c) 
seconds in order to make the events happen at ct = 0 and ct=\. but this shift will 
be irrelevant since all we are concerned with is differences in time.) 

Now consider an observer flying by to the right at speed v. She sees the apparatus 
flying by to the left at speed v (see Fig. 10.59). Our goal is to find the v for which 
she sees the photons hit the receivers at the same time. 

Consider the photons moving to the left. She sees them moving at speed c, but the 
left-hand receiver is retreating at speed v. So the relative speed of the photons and 
the left-hand receiver is c — v. By similar reasoning, the relative speed of the photons 
and the right-hand receiver is c + v. 

The light source is three times as far from the right-hand receiver as it is from the 
left-hand receiver. Therefore, if the light is to reach the two receivers at the same 
time, we must have c + v = 3(c — v). This gives v = c/2. 

22. Minkowski diagram units 

All points on the cf'-axis have the property that x' = 0. All points on the hyperbola 
have the property that c 2 t' 2 — x' 2 = 1, due to the invariance of ,s 2 . So the ct' value 
at the intersection point, A, equals 1. Therefore, we simply have to determine the 
distance from A to the origin (see Fig. 10.60). 

We’ll do this by finding the (x, ct) coordinates of A. We know that tan 9 = ft = 
v/c. Therefore, x = (3{ct) (i.e., x = vt). Plugging this into the given information, 
c 2 t 2 — x 2 = 1, we find ct = 1/y/l — 0 2 . So the distance from A to the origin is 
VcH 2 + x 2 = ct\f£‘+ fi 2 — ^{1 +'/? 2 )/,(l' — fl 2 ). The ratio of the unit sizes on the ct' 
and ct axes is therefore 

(10.133) 

which agrees with eq. (10.43). 

(Exactly the same analysis holds for the x-axis unit size ratio, of course.) 

23. Velocity Addition via Minkowski 

Pick a point P on the object’s worldline. Let the coordinates of P in frame S be 
( x,ct ). Our goal is to find the speed u = x/t. Throughout this problem, it will be 
easier to work with the quantities /3 = v/c\ so our goal is to find 0 U = x/{ct). 




Figure 10.59 



Figure 10.60 
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ct ct' 



Figure 10.61 


The coordinates of P in S', namely (x\ ct'), are shown in Fig. 10.61. For convenience, 
let ct' have length a on the paper. Then from the given information, we have x' = 
v\t' = Pi(ct') = (3ia. In terms of a, we can easily determine the coordinates ( x,ct ) of 
P. The coordinates of point A (shown in the figure) are simply 

(x, cl )a = (asin0 2 ,acos02). (10.134) 

The coordinates of P, relative to A, are 

(x,ct) P _A = (/?iacos02,/?iasin0 2 ). (10.135) 

So the coordinates of point P are 

(. x,ct)p = (o sin 6*2 + Pi a cos 0 2 , a cos 0 2 + /3iasin0 2 ). (10.136) 


The ratio of x to ct at the point P is therefore 

„ _ x _ sin 62 + Pi cos 62 
u ct cos 6*2 + Pi sin 6*2 

tan 0 2 + Pi 
1 + Pi tan 0 2 
P2 + P1 
1 + P1P2 ’ 


(10.137) 


where we have used tan0 2 = /c = /? 2 , because S' moves at speed with respect to 
S. If we change from the P’s back to the v’s, the result is u = (n 2 + ui)/(I + Vi u 2 /c 2 ). 



Figure 10.62 



24. Acceleration and redshift 

There are various ways to do this problem (for example, by sending photons between 
the people, or by invoking the gravitational equivalence principle in GR, etc.). We’ll do 
it here by using a Minkowski diagram, to demonstrate that it can be solved perfectly 
fine using only basic Special Relativity. 

Draw the world lines of the two people, A and B, as seen by an observer, C, in 
the frame where they were both initially at rest. We have the situation shown in 
Fig. 10.62. 

Consider an infinitesimal time At, as measured by C. At this time (in C’s frame), 
A and B are both moving at speed aAt. The axes of the A frame are shown in 
Fig. 10.63. Both A and B have moved a distance o(At) 2 /2, which can be neglected 
since At is small. 34 Also, the special-relativity time-dilation factor between any of the 
A,B,C frames can be neglected. (Any relative speeds are no greater than v = aAt, 
so the time-dilation factors differ from 1 by at most order (At) 2 .) Let A make a little 
explosion, Ei, at this time. Then At (which was defined to be the time as measured 
by C) is also the time of the explosion, as measured by A (up to an error of order 
(At) 2 ). 

Let’s figure out where A’s x-axis (i.e., the ‘now’ axis in A’s frame) meets B’s worldline. 
The slope of A’s x-axis in the figure is v/c = aAt/c. So the axis starts at a height 
cAt, and then climbs up by the amount adAt/c, over the distance d. Therefore, the 
axis meets B’s worldline at a height cAt + adAt/c, as viewed by C: i.e, at the time 
At + adAt/c : 2 , as viewed by C. But C”s time is the same as B’s time (up to order 

34 It will turn out that the leading-order terms in the result below are of order At. Any (At) 2 
terms can therefore be ignored. 


Figure 10.63 
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(At) 2 ). So B’s clock reads At(l + ad/c 2 ). Let’s say that B makes a little explosion, 
E 2 , at this time. 

A sees both Ei and E 2 occur at the same time (they both lie along a line of constant 
time in A’s frame). In other words, A sees B’s clock read At(l + ad/c 2 ) when he sees 
his own clock read At. Therefore, A sees B’s clock sped up by a factor 


A t B 
A T a 


(10.138) 


We can perform the same procedure to see how B views A’s clocks. Drawing B’s 
x-axis at time At, we easily find that B sees A’s clock slowed down by a factor 


At A _ ad 
Ate = “ C 2 ' 


(10.139) 


Remark: In the usual special-relativity situation where two observers fly past each other 
with relative speed v, they both see the other person’s time slowed down by the same factor. 
This had better be the case, since the situation is symmetric between the observers. But in 
this problem, A sees B’s clock sped up, and B sees As clock slowed down. This difference is 
possible because the situation is not symmetric between A and B. The acceleration vector 
determines a direction in space, and one person (namely B) is further along this direction 
than the other person ( A). X 

25. Break or not break? 

There are two possible reasonings. 

(1) To an observer in the original rest frame, the spaceships stay the same distance, 
d, apart. Therefore, in the frame of the spaceships, the distance between them, d', 
must be greater than d. This is the case because d equals d’/ 7, by the usual length 
contraction. After a long enough time, 7 will differ appreciably from 1, and the string 
will be stretched by a large factor. Therefore, it will break. 

(2) Let A be the back spaceship, and let B be the front spaceship. From the point of 
view of A (B’s point of view would work just as well), it looks like B is doing exactly 
what A is doing. It looks like B undergoes the same acceleration as A, so B should 
stay the same distance ahead of A. Therefore, the string should not break. 

The second reasoning is incorrect. The first reasoning is (mostly) correct. The trouble 
with the second reasoning is that the two spaceships are in different frames. A in fact 
sees B’s clock sped up, and B sees A’s clock slowed down (from Problem 24). A 
sees B’s engine working faster, and B therefore pulls away from A. So the string 
eventually breaks. 

The first reasoning is mostly correct. The only trouble with it is that there is no one 
“frame of the spaceships”. Their frames differ. It is not clear exactly what is meant 
by the “length of the string”, because it is not clear what frame the measurement 
should take place in. 

Everything becomes more clear once we draw a Minkowski diagram. Fig. 10.64 shows 
the x ' and ct' axes of A’s frame. The x'-axis is tilted up, so it meets B’s worldline 
further to the right than one might think. The distance PQ along the x'-axis is 
the distance that A measures the string to be. Although it is not obvious that this 
distance in A’s frame is larger than d (because the unit size on the x' axis is larger 
than that in C”s frame), we can easily demonstrate this. In A’s frame, the distance 
PQ is greater than the distance PQ'. But PQ' is simply the length of something 



d 

Figure 10.64 
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in A’s frame which has length d in C s frame. So PQ' is 7 d in A’s frame. Since 
PQ > jd > d in A’s frame, the string breaks. 

Remarks: 


(a) If you want there to eventually be a well-defined “frame of the spaceships”, you can 
modify the problem by stating that after a while, the spaceships both stop accelerating 
simultaneously (as measured by C). Equivalently, both A and B turn off their engines 
after equal proper times. 

What A sees is the following. B pulls away from A. Then B turns off his engine. The 
gap continues to widen. But A continues firing his engine until be reaches B’s speed. 
Then they sail onward, in a common frame, keeping a constant separation (which is 
greater than the original separation.) 

(b) The main issue in this problem is that it depends exactly how you choose to accelerate 
an extended object. If you accelerate a stick by pushing on the back end (or by pulling 
on the front end), the length will remain essentially the same in its own frame, and it 
will become shorter in the original frame. But if you arrange for each end (or perhaps 
a number of points on the stick) to speed up in such a way that they always move at 
the same speed with respect to the original frame, then the stick will get torn apart. 
* 


26. Successive Lorentz transformations 

It is not necessary, of course, to use matrices in this problem; but things look nicer if 
you do. The desired composite L.T. is obtained by multiplying the matrices for the 
individual L.T.’s. So we have 


L 


( cosh fa sinh(/>2 \ ( cosh fa sinh</>i \ 

\ sinh^ 2 cosh / \ cosh^! J 

( cosh (j) 1 cosh (j) 2 + sinh (j) 1 sinh (j) 2 sinh (f> 1 cosh (j) 2 + cosh (j) 1 sinh (f> 2 

cosh fa sinh fa + sinh fa cosh fa sinh fa sinh fa + cosh fa cosh fa 


/ cosh(</>i + fa) sinh(</>i + fa) \ 
\ sinh(</>i + fa) cosh(</>i + fa) J 


(10.140) 


This is the L.T. with v = tanh(</>i + fa), as desired. This proof is just like the one for 
successive rotations in the plane (except for a few minus signs). 

27. Accelerator’s time 

Eq. (10.62) gives the speed as a function of the spaceship’s time, 

/3(t’) = ^ = tanh(at , /c). (10.141) 

The person in the lab sees the spaceship’s clock slowed down by a factor 1/7 = 
Jpj i.e., dt = dt'/yf \ — (i 2 - So we have 


-fat = f , £ 

Jo Jo Vi - W? 

= / cosh (at'/c)dt' 

Jo 

= - sinh(at'/c). 


(10.142) 
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Note that for small a or t' (more precisely, if at'/c -C 1), we obtain t ~ t', as we 
should. For very large times, we essentially have 

or t'—-ln(2at/c). (10.143) 

The lab frame will see the astronaut read all of “Moby Dick”, but it will take an 
exponentially long time. 
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Chapter 11 

Relativity (Dynamics) 

Copyright 2004 by David Morin, morin@physics.harvard.edu 

In the previous chapter, we dealt only with abstract particles flying through space 
and time. We didn’t concern ourselves with the nature of the particles, how they got 
to be moving the way they were moving, or what would happen if various particles 
interacted. In this chapter we will deal with these issues. That is, we will discuss 
masses, forces, energy, momentum, etc. 

The two main results of this chapter are that the momentum and energy of a 
particle are given by 


p = 7mv, and E = ~/mc 2 , (11.1) 

where 7 = l/^/l — v 2 /c 2 , and m is the mass of the particle. 1 When v 7C c, the 
expression for p reduces to p = mv, as it should for a non-relativistic particle. 
When v = 0, the expression for E reduces to the well-known E = me 2 . 


11.1 Energy and momentum 

In this section, we’ll give some justification for eqs. (11.1). The reasoning here 
should convince you of their truth. An alternative, and perhaps more convincing, 
motivation comes from the 4-vector formalism in Chapter 12. In the end, however, 
the justification for eqs. (11.1) is obtained through experiments. Every day, ex¬ 
periments in high-energy accelerators are verifying the truth of these expressions. 
(More precisely, they are verifying that these energy and momenta are conserved in 
any type of collision.) We therefore conclude, with reasonable certainty, that eqs. 
(11.1) are the correct expressions for energy and momentum. 

But actual experiments aside, let’s consider a few thought-experiments that mo¬ 
tivate the above expressions. 


1 People use the word “mass” in different ways in relativity. They talk about “rest mass” and 
“relativistic mass”. These terms, however, are misleading. There is only one thing that can reason¬ 
ably be called “mass” in relativity. It is the same thing that we call “mass” in Newtonian physics, 
and it is what some people would call “rest mass”, although the qualifier “rest” is redundant. See 
Section 11.8 for a discussion of this issue. 
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Figure 11.1 



Figure 11.2 



Figure 11.3 


11.1.1 Momentum 


Consider the following system. In the lab frame, identical particles A and B move 
as shown in Fig. 11.1. They move with equal and opposite small speeds in the 
x-direction, and with equal and opposite large speeds in the y-direction. Their 
paths are arranged so that they glance off each other and reverse their motion in 
the x-direction. 

For clarity, imagine a series of equally spaced vertical lines for reference. Assume 
that both A and B have identical clocks that tick every time they cross one of the 
lines. 

Consider now the reference frame that moves in the y-direction, with the same 
v y as A. In this frame, the situation looks like Fig. 11.2. The collision simply 
changes the sign of the x-velocities of the particles. Therefore, the x-momenta of 
the two particles must be the same. 2 

However, the x-speeds of the two particles are not the same in this frame. A 
is essentially at rest in this frame, and B is moving with a very large speed, v. 
Therefore, B' s clock is running slower than A’s, by a factor essentially equal to 
l/q = y 1 — ip/c 2 . And since B' s clock ticks once for every vertical line it crosses 
(this fact is independent of the frame), B must therefore be moving slower in the 
x-direction, by a factor of I/7. 

Therefore, the Newtonian expression, p x = mv x , cannot be the correct one for 
momentum, because B 's momentum would be smaller than A’s (by a factor of 1 /7), 
due to their different v x s. But the 7 factor in 

p x = -imv x = mVx (11.2) 

VI - v 2 / c 2 


precisely takes care of this problem, because 7 « 1 for A, and 7 = 1 j\J\ — v 2 /c 2 for 
B , which precisely cancels the effect of B 's smaller v x . 

To obtain the three-dimensional form for p, we now note that the vector p must 
point in the same direction as the vector v points. 3 Therefore, eq. (11.2) implies 
that the momentum vector must be 


p = 7mv = 


mv 

Vr^w’ 


(11.3) 


in agreement with eq. (11.1). Note that that all the components of p have the same 
denominator, which involves the whole speed, v 2 = v 2 + v 2 + v 2 . The denominator 
of, say, p x , is not y/1 — v 2 fc 2 . 


Remark: The above setup is only one specific type of collision, among an infinite 
number of possible types of collisions. What we’ve shown with this setup is that the only 

2 This is true because if, say, X’s p x were larger than S’s p x , then the total p x would point to 
the right before the collision, and to the left after the collision. Since momentum is something we 
want to be conserved, this cannot be the case. 

3 This is true because any other direction for p would violate rotation invariance. If someone 
claims that p points in the direction shown in Fig. 11.3, then he would be hard-pressed to explain 
why it doesn’t instead point along the direction p' shown. In short, the direction of v is the only 
preferred direction in space. 








11.1. ENERGY AND MOMENTUM 


XI-3 


possible vector of the form /(w)tov (where / is some function) that has any chance at being 
conserved in all collisions is 7mv (or some constant multiple of this). We haven’t proved 
that it actually is conserved in all collisions. This is where the gathering of data from 
experiments comes in. But we’ve shown above that it would be a waste of time to consider, 
for example, the vector 7 5 mv. £ 


11.1.2 Energy 

Having given some justification for the momentum expression, p = ymv, let us now 
try to justify the energy expression, 

E = 7 me 2 . (11.4) 


More precisely, we will show that 7 me 2 is conserved in interactions (or at least in 
the specific interaction below). There are various ways to do this. The best way, 
perhaps, is to use the 4-vector formalism in Chapter 12. But we’ll study one simple 
setup here that should do the job. 

Consider the following system. Two identical particles of mass m head toward 
each other, both with speed u, as shown in Fig. 11.4. They stick together and form 
a particle of mass M. M is at rest, due to the symmetry of the situation. At the 
moment we cannot assume anything about the size of M. We will find below that 
it does not equal the naive value of 2m. 

This is a fairly uninteresting setup (conservation of momentum gives 0 = 0), but 
now consider the less trivial view from a frame moving to the left at speed u. This 
situation is shown in Fig. 11.5. The right mass is at rest, M moves to the right at 
speed u, and the left mass moves to the right at speed v = 2u/(l + u 2 ), from the 
velocity addition formula. 4 Note that the 7-factor associated with this speed v is 


1 1 1 + u 2 



Conservation of momentum in this collision then gives 


7 V mv + 0 



7 uMu 

Mu 

7T^ 


(11.5) 


( 11 . 6 ) 


Conservation of momentum therefore tells us that M does not equal 2m. But if 
u is very small, then M is approximately equal to 2m, as we know from everyday 
experience. 

Using the value of M from eq. (11.6), let’s now check that our candidate for 
energy, E = ymc 2 , is conserved in this collision. There is no freedom left in any of 

4 We’re going to set c = 1 for a little while here, because this calculation would get a bit messy 
if we kept in the c’s. We’ll discuss the issue of setting c = 1 in more detail later in this section. 
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the parameters, so 7 me 2 is either conserved or it isn’t. In the original frame where 
M is at rest, E is conserved if 

7o Me 2 = 2(7 u mc 2 ) u 2 = 2 ( \/l -u 2 ) ™ ’ ( 1L7 ) 

which is indeed true. 

Let’s also check that E is conserved in the frame where the right mass is at rest. 
E is conserved if 


7 v mc 2 + 70 me 2 = y u Mc 2 , or 



which is indeed true. So E is also conserved in this frame. 

Hopefully at this point you’re convinced that 7 me 2 is a believable expression for 
the energy of a particle. But just as in the case of momentum, we haven’t proved 
that 7 me 2 actually is conserved in all collisions. This is the duty of experiments. 
But we’ve shown that it would be a waste of time to consider, for example, the 
quantity 7 4 mc 2 . 

One thing that we certainly need to check is that if E and p are conserved in 
one reference frame, then they are conserved in any other. We’ll demonstrate this 
in Section 11.2. A conservation law shouldn’t depend on what frame you’re in, after 
all. 

Remarks: 

1. To be precise, we should say that technically we’re not trying to justify eqs. (11.1) 
here. These two equations by themselves are devoid of any meaning. All they do is 
define the letters p and E. Our goal is to make a meaningful physical statement, not 
just a definition. 

The meaningful physical statement we want to make is that the quantities qmv and 
7me 2 are conserved in an interaction among particles (and this is what we tried to 
justify above). This fact then makes these quantities worthy of special attention, 
because conserved quantities are very helpful in understanding what is happening 
in a given physical situation. And anything worthy of special attention certainly 
deserves a label, so we may then attach the names “momentum” and “energy” to 
7mv and 7 me 2 . Any other names would work just as well, of course, but we choose 
these because in the limit of small speeds, 7mv and 7 me 2 reduce (as we will soon 
show) to some other nicely conserved quantities, which someone already tagged with 
the labels “momentum” and “energy” long ago. 

2. As mentioned above, the fact of the matter is that we can’t prove that qmv and yrnc 2 
are conserved. In Newtonian physics, conservation of p = mv is basically postulated 
by Newton’s third law, and we’re not going to be able to do any better than that 
here. All we can hope to do as physicists is provide some motivation for considering 
7mv and 7 me 2 , show that it is consistent for 7mv and 7 me 2 to be conserved during 
an interaction, and gather a large amount of experimental evidence, all of which is 
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consistent with 7mv and 7me 2 being conserved. That’s how physics works. You can’t 
prove anything. So you learn to settle for the things you can’t disprove. 

Consider, when seeking gestalts, 

The theories that physics exalts. 

It’s not that they’re known 
To be written in stone. 

It’s just that we can’t say they’re false. 


As far as the experimental evidence goes, suffice it to say that high-energy accelerators, 
cosmological observations, and many other forums are continually verifying everything 
that we think is true about relativistic dynamics. If the theory is not correct, then we 
know that it must be the limiting theory of a more complete one (just as Newtonian 
physics is a limiting theory of relativity). But all this experimental induction has to 
count for something... 

“To three, five, and seven, assign 
A name,” the prof said, “We’ll define.” 

But he botched the instruction 

With woeful induction 

And told us the next prime was nine. 


3 . Conservation of energy in relativistic mechanics is actually a much simpler concept 
than it is in nonrelativistic mechanics, because E = 7 nn is conserved, period. We 
don’t have to worry about the generation of heat, which ruins conservation of the 
nonrelativistic E = mv 2 / 2 . The heat is simply built into the energy. In the example 
above, the two m’s collide and generate heat in the resulting mass M. This heat 
shows up as an increase in mass, which makes M larger than 2 m. The energy that 
corresponds to the increase in mass is due to the initial kinetic energy of the two m’s. 

4 . Problem 1 gives an alternate derivation of the energy and momentum expressions 
in eq. ( 11 . 1 ). This derivation uses additional facts, namely that the energy and 
momentum of a photon are given by E = hv and p = hv/c , where v is the frequency 
of the light wave, and h is Planck’s constant, X 

Any multiple of 7771c 2 is also conserved, of course. Why did we pick 7777c 2 to 
label as “E” instead of, say, 57777c 3 ? Consider the approximate form 7777c 2 takes in 
the Newtonian limit, that is, in the limit v <C c. We have, using the Taylor series 
expansion for (1 — x) -1 / 2 , 


E = 7777c 2 


me 


3 v 4 


me + -mv + ■ 


( 11 . 9 ) 


The dots represent higher-order terms in v 2 /c 2 , which may be neglected if v <C c. In 
an elastic collision in Newtonian physics, no heat is generated, so mass is conserved. 
That is, the quantity me 2 has a fixed value. We therefore see that conservation of 
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E = jmc 2 reduces to the familiar conservation of Newtonian kinetic energy, mv 2 / 2 , 
for elastic collisions in the limit of slow speeds. 

Likewise, we picked p = 7mv, instead of, say, 67mc 4 v, because the former 
reduces to the familiar Newtonian momentum, mv, in the limit of slow speeds. 


Whether abstract, profound, or just mystic, 
Or boring, or somewhat simplistic, 

A theory must lead 
To results that we need 
In limits, nonrelativistic. 


Whenever we use the term “energy”, we will mean the total energy, 7me 2 . If 
we use the term “kinetic energy”, we will mean a particle’s excess energy over its 
energy when it is motionless, that is, ymc 2 — me 2 . Note that kinetic energy is not 
necessarily conserved in a collision, because mass is not necessarily conserved, as 
we saw in eq. ( 11 . 6 ) in the above scenario, where M = 2 m/\/\ — u 2 . In the CM 
frame, there was kinetic energy before the collision, but none after. Kinetic energy 
is a rather artificial concept in relativity. You virtually always want to use the total 
energy, 7771c 2 , when solving a problem. 

Note the following extremely important relation, 


E 2 - |p| V 


7 2 m 2 c 4 — 7 2 m 2 |v| 2 c 2 


= 7 2 m 2 c 4 1 


-S) 


( 11 . 10 ) 


This is a primary ingredient in solving relativistic collision problems, as we will soon 
see. It replaces the KE = p 2 / 2 m relation between kinetic energy and momentum 
in Newtonian physics. It can be derived in more profound ways, as we will see in 
Chapter 12 . Let’s put it in a box, since it’s so important, 

\e 2 =p 2 c 2 + m 2 c A \. (11.11) 


In the case where m = 0 (as with photons), eq. ( 11 . 11 ) says that E = pc. This 
is the key equation for massless objects. For photons, the two equations, p = 7mv 
and E = jmc 2 , don’t tell us much, because m = 0 and 7 = 00, so their product is 
undetermined. But E 2 — |p| 2 c 2 = m 2 c 4 still holds, and we conclude that E = pc. 

Note that any massless particle must have 7 = 00. That is, it must travel at 
speed c. If this weren’t the case, then E = 7771c 2 would equal zero, in which case 
the particle isn’t much of a particle. We’d have a hard time observing something 
with no energy. 

Another nice relation, which holds for particles of any mass, is 


P 

E 


( 11 . 12 ) 


Given p and E, this is definitely the quickest way to get v. 
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Setting c = 1 

For the remainder of our treatment of relativity, we will invariably work in units 
where c = 1. For example, instead of one meter being the unit of distance, we can 
make 3 • 10 8 meters equal to one unit. Or, we can keep the meter as is, and make 
1/(3 • 10 8 ) seconds the unit of time. In such units, our various expressions become 

p = 7 mv, E = 7 m, E 2 =p 2 + m 2 , ^ = v. (11.13) 

E 

Said in another way, you can simply ignore all the c’s in your calculations (which 
will generally save you a lot of strife), and then put them back into your final answer 
to make the units correct. For example, let’s say the goal of a certain problem is to 
find the time of some event. If your answer comes out to be £, where i is a given 
length, then you know that the correct answer (in terms of the usual mks units) 
has to be i/c, because this has units of time. In order for this procedure to work, 
there must be only one way to put the c’s back in at the end. This is always the 
case, because if there were two ways, then we would have c° = c b , for some numbers 
a^b. But this is impossible, because c has units. 

The general size of me 2 

What is the general size of me 2 ? If we let m = 1 kg, then we have me 2 = (1 kg) (3 • 
10 8 m/s ) 2 ~ 10 17 J. How big is this? A typical household electric bill might around 
$50 per month, or $600 per year. At about 10 cents per kilowatt-hour, this translates 
to 6000 kilowatt-hours per year. Since there are 3600 seconds in an hour, this 
converts to (6000) (10 3 ) (3600) « 2 • 10 10 watt-seconds. That is, 2 • 10 10 Joules per 
year. We therefore see that if one kilogram were converted completely into usable 
energy, it would be enough to provide electricity to 10 17 /(2 • 10 10 ), or 5 million, 
homes for a year. That’s a lot. 

In a nuclear reactor, only a small fraction of the mass energy is converted into 
usable energy. Most of the mass remains in the final products, which doesn’t help 
in lighting up your home. If a particle were to combine with its antiparticle, then 
it would be possible for all of the mass energy to be converted into usable energy. 
But we’re still a few years away from this. 

However, even a small fraction of the very large quantity, E = me 2 , can still be 
large, as evidenced by the use of nuclear power and nuclear weapons. Any quantity 
with a few factors of c is bound to change the face of the world. 


11.2 Transformations of E and p 

Consider the following one-dimensional situation, where all the motion is along the 
x-axis. A particle has energy E' and momentum p 1 in frame S'. Frame S' moves at 
speed v with respect to frame S, in the positive x-direction (see Fig. 11.6). What 
are E and p in S ? 
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Let u' be the particle’s speed in S'. From the velocity addition formula, the 
particle’s speed in S is (dropping the factors of c) 


(11.14) 


This is all we need to know, because a particle’s velocity determines its energy and 
momentum. But we’ll need to go through a little algebra to make things look pretty. 
The 7 -factor associated with the speed u is 


1 + u'v 


\A-(uS) 


2 \/( 1 - u“)(l - W) 


= 7ti'7u(l + u'v). 


(11.15) 


The energy and momentum in S' are 


E' = "f u /rn. and p' = j u > mu'. 


(11.16) 


while the energy and momentum in S are, using eq. (11.15), 

E = 7 „m = 7 U ' 7„(1 + u'v)m, 

p = 7 U mu = 7 m /7„(1 + u'v)m ^ = lu>lv{u' + v)m. (11.17) 

Using the E' and p' from eq. (11.16), we can rewrite E and p as (with 7 — j v ) 

E = 7 (E' + vp'), 

p = 7 (p'+ vE'). (11.18) 


These are transformations for E and p between frames. If you want to put the 
factors of c back in, then the vE' term becomes vE' /(?. These equations are easy 
to remember, because they look exactly like the Lorentz transformations for the 
coordinates t and x in eq. (10.17). This is no coincidence, as we will see in Chapter 
12 . 

Remark: We can perform a few checks on eqs. ( 11 . 18 ). If u! = 0 (so that p' = 0 and 
E' = m), then E = 7 m and p = jmv, as they should. Also, if u' = — v (so that p' = — 7 mv 
and E' = 7 m), then E = m and p = 0 , as they should. 

Note that since the transformations in eq. (11.18) are linear, they also hold if E 
and p represent the total energy and momentum of a collection of particles. That 
is, 

Y, E =■ e'+ v J2p) > 

E? = t(Ep'+E e ')- ( n - 19 ) 

Indeed, any (corresponding) linear combinations of the energies and momenta are 
valid here, in place of the sums. For example, we can use the combinations (E\ + 
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3E% — 7E*) and {p\ + 3p% — 7p|) in eq. (11.18), where the subscripts indicate which 
particle, and the superscripts indicate before or after a collision. You can verify this 
by simply taking the appropriate linear combination of eqs. (11.18) for the various 
particles. This consequence of linearity is a very important and useful result, as will 
become clear in the remarks below. 

You can use eq. (11.18) to show that 

E 2 -p 2 = E' 2 -p' 2 , (11.20) 

just as we did to obtain the t 2 — x 2 = t' 2 — x' 2 result in eq. (10.37). The E’s and p’s 
here can represent any (corresponding) linear combinations of the E’s and p’s of the 
various particles, due to the linearity of eq. (11.18). For one particle, we already 
know that eq. (11.20) is true, because both sides are equal to m 2 , from eq. (11.10). 
For many particles, the invariant E 2 ota] — p 2 otal is equal to the square of the total 
energy in the CM frame (which reduces to m 2 for one particle), because ptotai = 0 
in the CM frame, by definition. 

Remarks: 

1. In the previous section, we said that we needed to show that if E and p are conserved 
in one reference frame, then they are conserved in any other frame (because a con¬ 
servation law shouldn’t depend on what frame you’re in). Eq. (11.18) quickly gives 
us this result, because the E and p in one frame are linear functions of the E' and p' 
in another frame. If the total A E' and A p' in S' axe zero, then eq. (11.18) says that 
the total A E and Ap in S must also be zero. We have used the fact that A E is a 
linear combination of the E’s, and that Ap is a linear combination of the p’s, so eq. 
(11.18) applies to these linear combinations. 

2. Eq. (11.18) makes it clear that if you accept the fact that p = jmv is conserved in 
all frames, then you must also accept the fact that E = 7 m is conserved in all frames 
(and vice versa). This is true because the second of eqs. (11.18) says that if Ap and 
A p' are both zero, then A E' must also be zero (again, we have used linearity). E and 
p have no choice but to go hand in hand. £ 

Eq. (11.18) applies to the ^-component of the momentum. How do the transverse 
components, p y and p z . transform? Just as with the y and z coordinates in the 
Lorentz transformations, p y and p z do not change between frames. The analysis in 
Chapter 12 makes this obvious, so for now we’ll simply state that 

Py = Py, 

Pz = p' z , (H.21) 

if the relative velocity between the frames is in the ^-direction. If you really want to 
show explicitly that the transverse components do not change between frames, or if 
you are worried that a nonzero speed in the y direction will mess up the relationship 
between p x and E that we calculated in eq. (11.18), then Exercise 4 is for you. But 
it’s a bit tedious, so feel free to settle for the much cleaner reasoning in Chapter 12. 
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11.3 Collisions and decays 

The strategy for studying relativistic collisions is the same as that for studying 
nonrelativistic ones. You simply have to write down all the conservation of energy 
and momentum equations, and then solve for whatever variables you want to solve 
for. The conservation principles are the same as they’ve always been. It’s just that 
now the energy and momentum take the new forms in eq. ( 11 . 1 ). 

In writing down the conservation of energy and momentum equations, it proves 
extremely useful to put E and p together into one four-component vector, 

P = (E,p) = (E,p x ,p y ,p z ). (11.22) 

This is called the energy-momentum 4-vector , or the ^-momentum, for short . 5 Our 
notation in this chapter will be to use an uppercase P to denote a 4-momentum 
and a lowercase p or p to denote a spatial momentum. The components of a 4- 
momentum are generally indexed from 0 to 3, so that Pq = E, and (Pi, P 2 , P 3 ) = p. 
For one particle, we have 

P = ( r ym, r ymv x ,'ymv y , r ymv z ). (11.23) 

The 4-momentum for a collection of particles simply consists of the total E and 
total p of all the particles. 

There are deep reasons for considering this four-component vector (as we will 
see in Chapter 12), but for now we will view it as simply a matter of convenience. If 
nothing else, it helps with the bookkeeping. Conservation of energy and momentum 
in a collision reduce to the concise statement, 

before = ^afterj (11.24) 

where these are the total 4-momenta of all the particles. 

If we define the inner product between two 4-momenta, A = (To, A\. A 2 , A 3 ) and 
B = (Po; Bi, B 2 , P 3 ), to be 

A • B = A 0 B 0 - A 1 P 1 - A 2 P 2 - A 3 B 3 , (11.25) 

then the relation E 2 — p 2 = m 2 (which is true for one particle) may be concisely 
written as 

P 2 = P ■ P = m 2 . (11.26) 

In other words, the square of a particle’s 4-momentum equals the square of its mass. 
This relation will prove to be very useful in collision problems. Note that it is 
frame-independent, as we saw in eq. ( 11 . 20 ). 

This inner product is different from the one we’re used to in three-dimensional 
space. It has one positive sign and three negative signs, in contrast with the usual 
three positive signs. But we are free to define it however we wish, and we did 
indeed pick a good definition, because our inner product is invariant under Lorentz- 
transformations, just as the usual 3-D inner product is invariant under rotations . 6 

5 If we were keeping in the factors of c, then the first term would be E/c, although some people 
instead multiply the p by c. Either convention is fine. 

6 For the inner product of a 4-momentum with itself (which could be any linear combination of 
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Example (Relativistic billiards): A particle with mass to and energy E ap¬ 
proaches an identical particle at rest. They collide (elastically) in such a way that 
they both scatter at an angle 9 relative to the incident direction (see Fig. 11.7). What 
is 9 in terms of E and to? What is 6 in the relativistic and non-relativistic limits? 

Solution: The first thing we should always do is write down the 4-momenta. The 
4-momenta before the collision are 

Pi = (F,p,0,0), P 2 = (to, 0,0,0), (11.27) 

where p = m 2 . The 4-momenta after the collision are (primes now denote 

“after”) 



P[ = (E',p' cos0,p' sin#,!)), P^ = (E',p' cos#, — p' sin#,0), (11.28) 


where p' = \fE' 2 — to 2 . Conservation of energy gives E’ = (E + to)/ 2, and con¬ 
servation of p x gives p' cos 0 = p/2. Therefore, the 4-momenta after the collision 
are 


(11.29) 


From eq. (11.26), the squares of these 4-momenta must be to 2 . Therefore, 


(E + to)' 


-J (1 + tan 2 9) 
l2 (P 2 — to 2 ) 


E 2 + 2 Em — 3 to 2 E + 3 to 


(11.30) 


The relativistic limit is E » to, which yields cos 9 « 1. Therefore, both particles 
scatter almost directly forward. 

The nonrelativistic limit is E ss to (it’s not E « 0), which yields cos 9 « 1 / \J2. 
Therefore, 9 ss 45°, and the particles scatter with a 90° angle between them. This 
agrees with the result from the example in Section 4.7.2, a result which pool players 
are very familiar with. 


Decays are basically the same as collisions. All you have to do is conserve energy 
and momentum, as the following example shows. 


Example (Decay at an angle): A particle with mass M and energy E decays 
into two identical particles. In the lab frame, they are emitted at angles 90° and 9, 
as shown in Fig. 11.8. What are the energies of the created particles? 

We’ll give two solutions. The second one shows how 4-momenta can be used in a very 
clever and time-saving way. 



Figure 11.8 


4-momenta of various particles), this invariance is simply the statement in eq. (11.20). For the 
inner product of two different 4-momenta, we’ll prove the invariance in Section 12.3. 
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First Solution: The 4-momentum before the decay is 

P = (E,p, 0,0), (11.31) 

where p = \ZE 2 — M' 2 . Let the created particles have mass m. The 4-momenta after 
the collision are 

Pl = (Fi,0,pi,0), P 2 = (E 2 ,P2Cose,-p 2 sm6,0). (11.32) 

Conservation of p x immediately gives p 2 cos 9 = p, which then implies that p 2 sin 9 = 
ptan9. Conservation of p y says that the final p y ’s are opposites. Therefore, the 
4-momenta after the collision are 

Pi = (Fi,0,ptan<9,0), P 2 = {E 2 ,p, -ptan9, 0). (11.33) 

Conservation of energy gives E = E\ + E 2 . Writing this in terms of the momenta and 
masses gives 

E = p 2 tan 2 9 +m 2 -|fi^p 2 (l + tan 2 9) + m 2 . (11.34) 

Putting the first radical on the left side, squaring, and solving for that radical (which 
is Pi) gives 


E 2 -p 2 = AP 
1 2 E 2 E ' 

In a similar manner, we find that E 2 equals 

_ E 2 + p 2 _ 2E 2 - M 2 
2 “ 2 E ~ 2 E 

These add up to E, as they should. 


(11.35) 

(11.36) 


Second Solution: With the 4-momenta defined as in eqs. (11.31) and (11.32), 
conservation of energy and momentum can be combined into the statement, P = 
P\ + P 2 . Therefore, 



And then E 2 =E-E l = (2 E 2 - M 2 ) /2 E. 

This solution should convince you that 4-momenta can save you a lot of work. What 
happened here was that the expression for P 2 was fairly messy, but we arranged things 
so that it only appeared in the form of P 2 , which is simply m 2 . 4-momenta provide 
a remarkably organized method for sweeping unwanted garbage under the rug. 
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11.4 Particle-physics units 

A branch of physics that uses relativity as one of its main ingredient is Elementary- 
Particle Physics, which is the study of the building blocks of matter (electrons, 
quarks, neutrinos, etc.). It is unfortunately the case that most of the elementary 
particles we want to study don’t exist naturally in the world. We therefore have 
to create them in particle accelerators by colliding other particles together at very 
high energies. The high speeds involved require the use of relativistic dynamics. 
Newtonian physics is essentially useless. 

What is a typical size of a rest energy, me 2 , of an elementary particle? The rest 
energy of a proton (which isn’t really elementary; it’s made up of quarks, but never 
mind) is 


E p = m p c 2 = (1.67 • 10" 27 kg)(3 • 10 s m/s) 2 = 1.5 • 10" 10 joules. (11.38) 


This is very small, of course. So a joule is probably not the best unit to work with. 
We would get very tired of writing the negative exponents over and over. 

We could perhaps work with “nanojoules”, but particle-physicists like to work 
instead with the “eV”, the electron-volt. This is the amount of energy gained by 
an electron when it passes through a potential of one volt. The electron charge is 
e = 1.6022 • 10 -19 C, and a volt is defined aslV = lJ/C. So the conversion from 
eV to joules is 7 


1 eV = (1.6022 • 10" 19 C)(l J/C) = 1.6022 • 10" 19 J. (11.39) 

Therefore, in terms of eV, the rest-energy of a proton is 938 • 10 6 eV. We now have 
the opposite problem of having a large exponent hanging around. But this is easily 
remedied by the prefix “M”, which stands for “mega”, or “million”. So we finally 
have a proton rest energy of 

E p = 938 MeV. (11.40) 


You can work out for yourself that the electron has a rest-energy of E e = 0.511 
MeV. The rest energies of various particles are listed in the table below. The ones 
preceded by a are the averages of differently charged particles, whose energies 
differ by a few MeV. These (and the many other) elementary particles have spe¬ 
cific properties (spin, charge, etc.), but for the present purposes they need only be 
thought of as point objects having a definite mass. 


7 This is getting a little picky, but “eV” should actually be written as “eV”, because “eV” stands 
for two things that are multiplied together (in contrast with, for example, the “kg” symbol for 
“kilogram”), one of which is the electron charge, which is usually denoted by e. 
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particle 

rest energy (MeV) 

electron (e) 

0.511 

muon (p) 

105.7 

tau (r) 

1784 

proton (p) 

938.3 

neutron (n) 

939.6 

lambda (A) 

1115.6 

sigma (S) 

k 1193 

delta (A) 

m 1232 

pion ( 7 r) 

» 137 

kaon (K) 

«496 


We now come to a slight abuse of language. When particle-physicists talk about 
masses, they say things like, “The mass of a proton is 938 MeV.” This, of course, 
makes no sense, because the units are wrong; a mass can’t equal an energy. But 
what they mean is that if you take this energy and divide it by c 2 , then you get 
the mass. It would truly be a pain to keep saying, “The mass is such-and-such an 
energy, divided by c 2 .” For a quick conversion back to kilograms, you can show that 

1 MeV/c 2 = 1.783 • 10" 30 kg. £11.41) 


11.5 Force 


11.5.1 Force in one dimension 

“Force” is a fairly intuitive concept. It is how hard you push or pull on something. 
We were told long ago that F equals ma, and this makes sense. If you push an 
object in a certain direction, then it accelerates in that direction. But, alas, we’ve 
now outgrown the F = ma definition. It’s time to look at things a different way. 

The force on an object is hereby defined to be the rate of change of momentum 
(we’ll just deal with one-dimensional motion for now), 


F 


dp 

dt 


(11.42) 


This is actually the definition in nonrelativistic physics too, but in that case, where 
p = mv, we obtain F = ma anyway. So it doesn’t matter if we define F to be dp/dt 
or ma. But in the relativistic case, it does matter, because p = 7 mv, and 7 can 
change with time. This will complicate things, and it will turn out that F does not 
equal ma. Why do we define F to be dp/dt instead of ma? One reason is given 
in the first remark below. Another arises from the general 4-vector formalism in 
Chapter 12. 

To see what form the F in eq. (11.42) takes in terms of the acceleration, a, note 


that 


dl _ d / 1 \ 

dt dt V \/l — v 2 J 


vv 

(1 - v 2 ) 3 / 2 


• 7 3 va. 


(11.43) 
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Therefore, assuming that m is constant, we have 

F=^ _ + 

= ma'y^v 2 + 1) 

= 7 3 ma. (11.44) 

This doesn’t look as nice as F = ma, but that’s the way it goes. 

They said, “F is ma, bar none.” 

What they meant wasn’t quite as much fun. 

It’s dp by dt, 

Which just happens to be 
Good ol’ “ma” when 7 is 1. 


Consider now the quantity dE/dx, where E is the energy, E = 7 m. We have 


dE dt^m) 
dx dx 


dx ' 


(11.45) 


But v(dv/dx) = dv/dt = a. Therefore, dE/dx = 7 3 ma, and eq. (11.44) gives 


F 


dE 

dx 


(11.46) 


Note that eqs. (11.42) and (11.46) take exactly the same form as in the nonrela- 
tivistic case. The only new thing in the relativistic case is that the expressions for 
p and E are modified. 

Remarks: 


1. Eq. (11.42) is devoid of any physical content, because all it does is define F. If F 
were instead defined through eq. (11.46), then eq. (11.46) would be devoid of any 
content. The whole point of this section, and the only thing of any substance, is that 
(with the definitions p = 7 mv and E = 7 m) 


dp _ dE 
dt dx 


(11.47) 


This is the physically meaningful statement. If we then want to label both sides of 
the equation with the letter F for “force,” so be it. But “force” is simply a name. 

2. The result in eq. (11.46) suggests another way to arrive at the E = 7 m relation. The 
reasoning is exactly the same as in the nonrelativistic derivation of energy conservation 
in Section 4.1. Define F, as we have done, through eq. (11.42). Then integrate eq. 
(11.44) from x\ to X 2 to obtain 


/> - /> 
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(11.48) 


where we have used eq. (11.46). If we then define the “potential energy” as 

V(x) = - j F(x) dx, (11.49) 

where Xo is an arbitrary reference point, we obtain 

V(xi) + ymj = V(x 2 ) + qmj . (11.50) 

We see that the quantity V +7 m is independent of x. It is therefore worthy of a name, 
and we use the name “energy” due to the similarity with the Newtonian result . 8 
The work-energy theorem (that is, / F dx = A E) holds in relativistic physics, just 
as it does in the nonrelativistic case. The only difference is that E is 7 m instead of 
mv 2 / 2. * 


11.5.2 Force in two dimensions 

In two dimensions, the concept of force becomes a little strange. In particular, as 
we will see, the acceleration of an object need not point in the same direction as the 
force. We start with the definition, 


F = 


dp 

dt 


(11.51) 


This is a vector equation. Without loss of generality, let us deal with only two 
spatial dimensions. Consider a particle moving in the x-direction, and let us apply 
a force, F = (F x , F y ). The particle’s momentum is 



Taking the derivative of this, and using the fact that v y is initially zero, we obtain 



8 Actually, this derivation only suggests that E is given by 7 m up to an additive constant. For 
all we know, E might take the form, E = jm — m, which would make the energy of a motionless 
particle equal to zero. An argument along the lines of Section 11.1.2 is required to show that the 
additive constant is zero. 
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= m( 7 3 a a; , 7 a 2/ ). (11.53) 

We see that this is not proportional to (a x , a y ). The first component agrees with eq. 
(11.44), but the second component has only one factor of 7 . The difference comes 
from the fact that 7 has a first-order change if v x changes, but not if v y changes, 
assuming that v y is initially zero. The particle therefore responds differently to 
forces in the x- and y-directions. It is easier to accelerate something in the transverse 
direction. 


11.5.3 Transformation of forces 

Let a force act on a particle. How are the components of the force in the particle’s 
frame, S', related to the components of the force in another frame, S '? 9 Let the 
relative motion be along the x- and x y -axes, as in Fig. 11.9. In frame S, eq. (11.53) 
says 

(F x , F y ) = m(-f 3 a x ,-fa y ). (11.54) 

And in frame S', the 7 factor for the particle equals 1 , so eq. (11.53) reduces to the 
usual expression, 

(F' x ,F' y ) = m(a' x ,a' y ). (11.55) 

Let’s now try to relate these two forces, by writing the primed accelerations on the 
right-hand side of eq. (11.55) in terms of the unprimed accelerations. 

First, we have a! y = j 2 a y . This is true because transverse distances are the same 
in the two frames, but times are shorter in S' by a factor 7 . That is, dt' = dt/ 7 . We 
have indeed put the 7 in the right place here, because the particle is essentially at rest 
in S', so the usual time dilation holds. Therefore, a' y = d 2 y'/dt' 2 = d 2 y/(dt/j) 2 = 
l 2(X y 

Second, we have a' x = 7 3 a x . In short, this is true because time dilation brings 
in two factors of 7 (as in the a y case), and length contraction brings in one. In a 
little more detail: Let the particle move from one point to another in frame S' , as it 
accelerates from rest in S'. Mark these two points, which are a distance a' x (dt') 2 / 2 
apart, in S'. As S' flies past S, the distance between the two marks will be length 
contracted by a factor 7 , as viewed by S. This distance (which is the excess distance 
the particle has over what it would have had if there were no acceleration) is what 
S calls a x (dt) 2 / 2. Therefore, 

^ a x dt 2 = ^ Q<4 dt ,2S j => a' x = ya x (Jj^j = -/ 3 a x . (11.56) 

9 To be more precise, S' is the instantaneous inertial frame of the particle. Once the force is 
applied, the particle’s frame will no longer be S' . But for a very small elapsed time, the particle 
will still essentially be in S'. 



Figure 11.9 
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Eq. (11.55) may now be written as 


(. F' x ,F ' y ) = m(T A a x , r ) 2 a y ). 

Finally, comparing eqs. (11.54) and (11.57), we find 

. F' 

F x = F and F v = v 
7 


(11.57) 


(11.58) 


We see that the longitudinal force is the same in the two frames, but the transverse 
force is larger by a factor of 7 in the particle’s frame. 

Remarks: 

1. What if someone comes along and relabels the primed and unprimed frames in eq. 
(11.58), and concludes that the transverse force is smaller in the particle’s frame? He 
certainly can’t be correct, given that eq. (11.58) is true, but where is the error? 

The error lies in the fact that we (correctly) used dt' = dt/'y above, because this is 
the relevant expression concerning two events along the particle’s worldline. We are 
interested in two such events, because we want to see how the particle moves. The 
inverted expression, dt = dt'/'y, deals with two events located at the same position in 
S, and therefore has nothing to do with the situation at hand. Similar reasoning holds 
for the relation between dx and dx'. There is indeed one frame here that is special 
among all the possible frames, namely the particle’s instantaneous inertial frame. 

2. If you want to compare forces in two frames, neither of which is the particle’s rest 
frame, then just use eq. (11.58) twice and relate each of the forces to the rest- 
frame forces. It quickly follows that for another frame S", we have F” = F x , and 
7 "Fy = 7 F y , where the 7 ’s are measured relative to the rest fame, S'. £ 



"V 5 


Figure 11.10 



Figure 11.11 


Example (Bead on a rod): A spring with a tension has one end attached to the 
end of a rod, and the other end attached to a bead which is constrained to move along 
the rod. The rod makes an angle 9' with respect to the x'-axis, and is fixed at rest in 
the S' frame (see Fig. 11.10). The bead is released and is pulled along the rod. 
When the bead is released, what does the situation look like in the frame, S, of 
someone moving to the left at speed v? In answering this, draw the directions of 

(a) the rod, 

(b) the acceleration of the bead, and 

(c) the force on the bead. 

In frame S, does the wire exert a force of constraint? 

Solution: In frame S: 

(a) The horizontal span of the rod is decreased by a factor 7 , due to length contrac¬ 
tion, and the vertical span is unchanged, so we have tan 0 = 7 tan O', as shown 
in Fig. 11.11. 
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(b) The acceleration must point along the rod, because the bead always lies on 
the rod. Quantitatively, the position of the bead in frame S takes the form of 
( x,y ) = (vt — a x t 2 /2,—a.yt 2 /2), by the definition of acceleration. The position 
relative to the starting point on the rod, which has coordinates (vt, 0 ), is then 
(Ax, Ay) = (—a x t 2 /2, —a y t 2 / 2). The condition for the bead to stay on the rod 
is that the ratio of these coordinates be equal the slope of the rod in Frame S. 
Therefore, a y /a x = tan 9, so the acceleration points along the rod. 

(c) The y-component of the force on the bead is decreased by a factor 7 , by eq. 

(11.58), so we have tan^ = ( 1 / 7 ) tan#', as shown in the figure. 

As a double-check that a does indeed point along the rod, we can use eq. (11.53) to 
write a y /a x = 7 2 F y /F x . Then eq. (11.58) gives a y /a x = 7 F' y IF' x = qtan#' = tan 9, 
which is the direction of the wire. 

The wire does not exert a force of constraint. The bead need not touch the wire in 
S', so it need not touch it in S. Basically, there is no need to have an extra force to 
combine with F to make the result point along a, because F simply does not have to 
be collinear with a. 


11.6 Rocket motion 

Up to this point, we have dealt with situations where the masses of our particles are 
constant, or where they change abruptly (as in a decay, where the sum of the masses 
of the products is less than the mass of the initial particle). But in many problems, 
the mass of an object changes continuously. A rocket is the classic example of this 
type of situation. Hence, we will use the term “rocket motion” to describe the 
general class of problems where the mass changes continuously. 

The relativistic rocket itself encompasses all of the important ideas, so let’s study 
that example here. Many more examples are left for the problems. We’ll present 
three solutions to the rocket problem, the last of which is rather slick. In the end, 
the solutions are all basically the same, but it should be helpful to see the various 
ways of looking at the problem. 


Example (Relativistic rocket): Assume that a rocket propels itself by continually 
converting mass into photons and firing them out the back. Let m be the instanta¬ 
neous mass of the rocket, and let v be the instantaneous speed with respect to the 
ground. Show that 


dv 


= 0 . 


(11.59) 


If the initial mass is M, and the initial v is zero, integrate eq. (11.59) to obtain 



(11.60) 


First solution: The strategy of this solution will be to use conservation of momen¬ 
tum in the ground frame. 
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Consider the effect of a small mass being converted into photons. The mass of the 
rocket goes from to to m + dm (where dm is negative). So in the frame of the rocket, 
photons with total energy E r = —dm (which is positive) are fired out the back. In 
the frame of the rocket, these photons have momentum p r = dm (which is negative). 
Let the rocket move with speed v with respect to the ground. Then the momentum 
of the photons in the ground frame, p g , may be found via the Lorentz transformation, 

p g = 7 {p r + vE r ) = y(dm + v(-dm)) = 7(1 - v) dm. (11.61) 

This is still negative, of course. 

Remark: A common error is to say that the converted mass (—dm) takes the form of 
photons of energy (—dm) in the ground frame. This is incorrect, because although the 
photons have energy (—dm) in the rocket frame, they are redshifted (due to the Doppler 
effect) in the ground frame. From eq. (10.48), we see that the frequency (and hence the 
energy) of the photons decreases by a factor of y/(l — v)/(l + v) when going from the rocket 
frame to the ground frame. This factor equals the 7(1 — v) factor in eq. (11.61). £ 


We may now use conservation of momentum in the ground frame to say that 
(TO 7 u) 0 id = 7(1 — v) dm + (myv) new ==> 7(1 — v) dm + d(myv) = 0 . (11.62) 


The d(myv) term may be expanded to give 


d(m yv) 


(dm)yv + m(dy)v + my(dv) 
yv dm + to( y 3 v dv)v + m'y dv 
yv dm + my(y 2 v 2 + 1 ) dv 
yv dm + m'y 3 dv. 


Therefore, eq. (11.62) gives 

0 = 7(1 — v) dm + yv dm + my 3 dv 
= ydm + my 3 dv. 


(11.63) 


(11.64) 


Hence, 


dv 


= 0, 


(11.65) 


in agreement with eq. (11.59). We must now integrate this. With the given initial 
values, we have 


r m dm I v a 

Jm m Jo 1 - 


( 11 . 66 ) 

We could simply look up the dv integral in a table, but let’s do it from scratch . 10 
Writing 1/(1 — v 2 ) as the sum of two fractions gives 



= J f ln(l + v) — ln(l — v)) P 

= < n - 67 > 

10 Tables often list the integral of 1/(1 — v 2 ) as tanh -1 (v), which you can show is equivalent to 
the result in eq. (11.67). 
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Eq. (11.66) therefore gives 



( 11 . 68 ) 


in agreement with eq. (11.60). This result is independent of the rate at which the 
mass is converted into photons. It is also independent of the frequency of the emitted 
photons. Only the total mass expelled matters. 

Note that eq. (11.68) quickly tells us that the energy of the rocket, as a function of 
velocity, is 

E = 'ym = 7 M \Jj^ = ■ (11.69) 

This has the interesting property of approaching M/2 as v —> c. In other words, 
half of the initial energy remains with the rocket, and half ends up as photons (see 
Exercise 18). 


Remark: From eq. (11.61), or from the previous remark, we see that the ratio of the energy 
of the photons in the ground frame to that in the rocket frame is ^/(l — v)/(l + v). This 
factor is the same as the factor in eq. (11.68). In other words, the photons’ energy decreases 
in exactly the same manner as the mass of the rocket (assuming that the photons are ejected 
with the same frequency in the rocket frame throughout the process). Therefore, in the 
ground frame, the ratio of the photons’ energy to the mass of the rocket doesn’t change with 
time. There must be a nice intuitive explanation for this, but it eludes me. & 


Second solution: The strategy of this solution will be to use F = dp/dt in the 
ground frame. 

Let r denote the time in the rocket frame. Then in the rocket frame, dm/dr is the 
rate at which the mass of the rocket decreases and is converted into photons (dm 
is negative). The photons therefore acquire momentum at the rate dp/dr = dm/dr 
in the rocket frame. Since force is the rate of change of momentum, we see that a 
force of dm/dr pushes the photons backward, and an equal and opposite force of 
F = —dm/dr pushes the rocket forward in the rocket frame. 

Now go to the ground frame. We know from eq. (11.58) that the longitudinal force is 
the same in both frames, so F = —dm/dr is also the force on the rocket in the ground 
frame. And since t = 7 r, where t is the time on the ground (the photon emissions 
occur at the same place in the rocket frame, so we have indeed put the time-dilation 
factor of 7 in the right place), we have 



Remark: We can also calculate the force on the rocket by working entirely in the ground 
frame. Consider a mass (—dm) that is converted into photons. Initially, this mass is traveling 
along with the rocket, so it has momentum (—dm)'yv. After it is converted into photons, it has 
momentum 7(1 — 1 ;) dm (from the first solution above). The change in momentum is therefore 
7(1 — v) dm — (—dm)jv = 7 dm. Since force is the rate of change of momentum, a force of 
7 dm/dt pushes the photons backwards, and an equal and opposite force of F = —7 dm/dl 
therefore pushes the rocket forwards. & 
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Now things get a little tricky. It is tempting to write down F = dp/dt = d(m r yv)/dt = 
(dm/dt)^v + md('yv)/dt. This, however, is not correct, because the dm/dt term is 
not relevant here. When the force is applied to the rocket at an instant when the 
rocket has mass m, the only thing the force cares about is that the mass of the rocket 
at the given instant is m. It doesn’t care that m is changing . 11 Therefore, the correct 
expression we want is 


F=m ^kA 

dt 


(11.71) 


As in the first solution above, or in eq. (11.44), we have d(jv)/dt = 7 3 dv/dt. Using 
the F from eq. (11.70), we arrive at 


dm 

7 lit - 


,3 dv 

dt ’ 


which is equivalent to eq. (11.64). The solution proceeds as above. 


(11.72) 


Third solution: The strategy of this solution will be to use conservation of energy 
and momentum in the ground frame, in a slick way. 

Consider a clump of photons fired out the back. The energy and momentum of these 
photons are equal in magnitude and opposite in sign (with the convention that the 
photons are fired in the negative direction). By conservation of energy and momen¬ 
tum, the same statement must be true about the changes in energy and momentum 
of the rocket. That is, 


d( 7 m) = —d^mv) => d(jm + jmv) = 0. (11.73) 

Therefore, 7 m(l + v) is a constant. We are given that m = M when v = 0. Hence, 
the constant must be M. Therefore, 


7 m(l + v) = M 



(11.74) 


Now, that’s a quick solution, if there ever was one! 


11.7 Relativistic strings 

Consider a “massless” string with a tension that is constant (that is, independent 
of length). 12 We will call such objects relativistic strings, and we will study them 
for two reasons. First, these strings, or reasonable approximations thereof, actually 
do occur in nature. For example, the gluon force which holds quarks together is 
approximately constant over distance. And second, they open the door to a whole 
new supply of problems we can solve, like the following one. 


n Said in a different way, the momentum associated with the missing mass still exists. It’s just 
that it’s not part of the rocket anymore. This issue is expanded on in Appendix E. 

12 By “massless,” we mean that the string has no mass in its unstretched (that is, zero-length) 
state. Once it is stretched, it will have energy, and hence mass. 
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Example (Mass connected to a wall): A mass to is connected to a wall by a 
relativistic string with tension T. The mass starts next to the wall and has initial 
speed v away from it (see Fig. 11.12). What is the maximum distance from the wall 
the mass achieves? How much time does it take to reach this point? 

Solution: Let I be the maximum distance from the wall. The initial energy of 
the mass is E = 7 m. The final energy at x = I is simply m, because the mass is 
instantaneously at rest there. Integrating F = dE/dx, and using the fact that the 
force always equals — T, gives 

FAx = A E => {-T)I =m-ym => 1= m ^~ ^ . (11.75) 

Let t be the time it takes to reach this point. The initial momentum of the mass is 
p = 7 mv. Integrating F = dp/dt, and using the fact that the force always equals — T, 
gives 

FAt = Ap => (~T)t = 0 — 77 m; ==> t = _ (11.76) 

Note that we cannot use F = ma to do this problem. F does not equal ma. It equals 
dp/dt (and also dE/dx). 


Relativistic strings may seem a bit strange, but there is nothing more to solving 
a one-dimensional problem than the two equations, 

F = , and F =^. (11.77) 

dt' dx v ’ 


Example (Where the masses meet): A relativistic string of length I and tension 
T connects a mass m and a mass M (see Fig. 11.13). The masses are released from 
rest. Where do they meet? 


Solution: Let the masses meet at a distance x from the initial position of to. At this 
meeting point, F = dE/dx tells us that th e energy of to is m + Tx, and the energy of 
M is M + T(£ — x). Using p — to 2 we see that the magnitudes of the momenta 

at the meeting point are 

p m = y/{m + Tx) 2 - to 2 and p M = V(M + T(£ - x)) 2 - M 2 . (11.78) 


But F = dp/dt then tells us that these must be equal, because the same force (in 
magnitude, but opposite in direction) acts on the two masses for the same time. 
Equating the above p’s gives 


£(T(t/2) + M) 
M + m + Tl 


(11.79) 


This is reassuring, because the answer is simply the location of the initial center of 
mass, with the string being treated (quite correctly) like a stick of length £ and mass 
Tt (divided by c 2 ). 



Figure 11.12 


/ 

. y 

Figure 11.13 
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Remark: Let’s check a few limits. In the limit of large T or i (more precisely, in the limit 
T£ Me 2 and T£ me 2 ), we have x = If 2. This makes sense, because in this case the 
masses are negligible and therefore both move at essentially speed c, and hence meet in the 
middle. In the limit of small T or £ (more precisely, in the limit T£ <C Me 2 and T£ <C me 2 ), 
we have x = M£/(M + m), which is simply the Newtonian result for an everyday-strength 
spring. & 


11.8 Mass 

Some treatments of relativity refer to the mass of a motionless particle as the “rest- 
mass” mo, and the mass of moving particle as the “relativistic mass” m re i = ymo• 
This terminology is misleading and should be avoided. There is no such thing as 
“relativistic mass.” There is only one “mass” associated with an object. This mass 
is what the above treatments would call the “rest mass .” 13 And since there is only 
one type of mass, there is no need to use the qualifier “rest” or the subscript “ 0 .” 
We therefore simply use the notation “m.” In this section, we will explain why 
“relativistic mass” is not a good concept to use . 14 

Why might someone want to call m re i = 7 m the mass of a moving particle? The 
basic reason is that the momentum takes the nice Newtonian form of p = m re iv. 
The tacit assumption here is that the goal is to assign a mass to the particle such 
that all the Newtonian expressions continue to hold, with the only change being a 
modified mass. That is, we want our particle to act in exactly the same way that a 
particle of mass 7 m would, according to our everyday intuition . 15 

If we insist on hanging onto our Newtonian rules, let’s see what they imply. If 
we want our particle to act as a mass 7 m does, then we must have F = ( 7 m)a. 
However, we saw in Section 11.5.2 that although this equation is true for transverse 
forces, it is not true for longitudinal forces. The 7 m would have to be replaced by 
7 3 m for a longitudinal force. As far as acceleration goes, a mass reacts differently 
to forces that point in different directions. We therefore see that it is impossible to 
assign a unique mass to a moving particle, such that it behaves in a Newtonian way 
under all circumstances. Not only is the goal of thinking of things in a Newtonian 
way ill-advised, it is doomed to failure. 

13 For example, the mass of an electron is 9.11 • 10 -31 kg, and the mass of a liter of water is 1 kg, 
independent of the speed. 

14 Of course, you can define the quantity 7 m with any name you want. You can call it “relativistic 
mass,” or you can call it “pumpkin pie.” The point is that the connotations associated with these 
definitions will mislead you into thinking certain things are true when they are not. The quantity 
7 m does not behave as you might want a mass to behave (as we will show). And it also doesn’t 
make for a good dessert. 

ls This goal should send up a red flag. It is similar to trying to think about quantum mechanics 
in terms of classical mechanics. It simply cannot be done. All analogies will eventually break down 
and lead to incorrect conclusions. It is quite silly to try to think about a (more) correct theory 
(relativity or quantum mechanics) in terms of an incorrect theory (classical mechanics), simply 
because our intuition (which is limited and incorrect) is based on the latter. 
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“Force is my a times my ‘mass’,’’ 

Said the driver, when starting to pass. 

But from what we’ve just learned, 

He was right when he turned, 

But wrong when he stepped on the gas. 

The above argument closes the case on this subject, but there are a few other 
arguments that show why it is not good to think of 7 m as a mass. 

The word “mass” is used to describe what is on the right-hand side of the 
equation, E 2 — |p | 2 = m 2 . The m 2 here is an invariant, that is, it is something that 
is independent of the frame of reference. E and the components of p, on the other 
hand, are components of a 4-vector. They depend on the frame. If “mass” is to be 
used in this definite way to describe an invariant, then it doesn’t make sense to also 
use it to describe the quantity 7 m, which is frame-dependent. And besides, there is 
certainly no need to give 7 m another name. It already goes by the name up 
to factors of c. 

It is often claimed that 7 m is the “mass” that appears in the expression for 
gravitational force. If this were true, then it might be reasonable to use “mass” as 
a label for the quantity 7 m. But, in fact, it is not true. The gravitational force 
depends in a somewhat complicated way on the motion of the particle. For example, 
the force depends on whether the particle is moving longitudinally or transversely 
to the source. We cannot demonstrate this fact here, but suffice it to say that if one 
insists on using the naive force law, F = Gmim%/r 2 , then it is impossible to label 
the particle with a unique mass. 
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11.9 Exercises 

Section 11.2: Transformations of E and p 

1. Energy of two masses * 

Two masses M move at speed V. one to the east and one to the west. What 
is the total energy of the system? 

Now consider the setup as viewed from a frame moving to the west at speed 
u. Find the energy of each mass in this frame. Is the total energy larger or 
smaller than the total energy in the lab frame? 

2. System of particles * 

Given ptotai and E tot/a ] for a system of particles, use a Lorentz transformation 
to find the velocity of the CM. More precisely, find the speed of the frame in 
which the total momentum is zero. 

3. CM frame ** 

A mass m travels at speed 3c/5, and another mass m sits at rest. 

(a) Find the energy and momentum of the two particles in the lab frame. 

(b) Find the speed of the CM of the system, by using a velocity-addition 
argument. 

(c) Find the energy and momentum of the two particles in the CM frame. 

(d) Verify that the E's and p’s are related by the relevant Lorentz transfor¬ 
mations. 

(e) Verify that E t 2 otal — pf otal is the same in both frames. 

4. Transformation for 2-D motion ** 

A particle has velocity (u' x ,u y ) in frame S'. which travels at speed v in the 
x-direction relative to frame S. Use the velocity addition formulas in Section 
10.3.3 (eqs. (10.33) and (10.35)) to show that E and p x transform according 
to eq. (11.18), and also that p y = p' y . 

Hint This gets a bit messy, but the main thing you need to show is 

7u = Tu'Tv{l + u 'x v )i where u = iju^ + u* and v! = + u'(j (11.80) 

are the speeds in the two frames. 

Section 11.3: Collisions and decays 

5. Photon, mass collision * 

A photon with energy E collides with a stationary mass m. The combine to 
form one particle. What is the mass of this particle? What is its speed? 

6. A decay * 

A mass M decays into a mass m and a photon. If the speed of rn is v. find rn 
and also the energy of the photon (in terms of M and v). 
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7. Three photons * 

A mass m travels with speed v. It decays into three photons, one of which 
travels in the forward direction, and the other two of which move at angles of 
120° (in the lab frame) as shown in Fig. 11.14. What are the energies of these 
three photons? 



Figure 11.14 


8. Perpendicular photon * 

A photon with energy E collides with a mass M. The mass M scatters off 
at an angle. If the resulting photon moves perpendicularly to the incident 
photon’s direction, as shown in Fig. 11.15, what is its energy? 

9. Another perpendicular photon ** 

A mass m moving with speed 4c/5 collides with another mass m at rest. The 
collision produces a photon with energy E traveling perpendicularly to the 
original direction, and a mass M traveling in another direction, as shown in 
Fig. 11.16. In terms of E and m, what is Ml What is the largest value of E 
(in terms of m) for which this setup is possible? 

10. Colliding diagonally * 

A mass m moving northeastward at speed 4c/5 collides with a photon moving 
southeastward. The result of the collision is one particle of mass M moving 
eastward, as shown in Fig. 11.17 Find the energy of the photon, the mass M, 
and the speed of M. (Give the first two of these answers in terms of m.) 

11. Decay into photons * 

A mass M traveling at 3c/5 decays into a mass M/4 and two photons. One 
photon moves perpendicularly to the original direction, the other photon moves 
off at an angle 9, and the mass M/4 is at rest, as shown in Fig. 11.18. What 
is 91 


E M 


4 



• M 


Figure 11.15 


4c 

5 





• M 


12. Three masses colliding * 

Three masses m, all traveling at speed v = 4c/5, collide at the origin and 
produce a particle of mass M. The three original velocities are in the northeast, 
north, and northwest directions. Find M and its velocity. 

13. Maximum mass * 

A photon and a mass m move in opposite directions. They collide head-on 
and create a new particle. If the total energy of the system is E, how should 
it be divided between the photon and the mass m, so that the mass of the 
resulting particle is as large as possible? 


Figure 11.16 



Figure 11.17 



Figure 11.18 
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Figure 11.19 


Section 11.4: Particle-physics units 

14. Pion-muon race * 

A pion and a muon each have energy 10 GeV. They have a 100 m race. By 
how much distance does the muon win? 

Section 11.5: Force 

15. Force and a collision * 

Two identical masses m are at rest, a distance x apart. A constant force F 
accelerates one of them towards the other until they collide and stick together. 
How much time does this take? What is the mass of the resulting particle? 

16. Pushing on a mass ** 

A mass m starts at rest. You push on it with a constant force F. 

(a) How much time, t, does it take to move the mass a distance x ? (Both t 
and x here are measured in the lab frame.) 

(b) After a very long time, the speed of m will approach the speed of light. 
Therefore, after a very long time, m will remain (approximately) a con¬ 
stant distance (as measured in the lab frame) behind a photon that was 
emitted at t = 0 from the starting position of m. Show that this distance 
equals mc 2 /F. 

17. Momentum paradox *** 

Two equal masses are connected by a massless string with tension T. The 
masses are constrained to move with speed v along parallel lines, as shown 
in Fig. 11.19. The constraints are then removed, and the masses are drawn 
together. They collide and make one blob which continues to move to the 
right. Is the following reasoning correct? If your answer is “no”, state what is 
invalid about whichever of the four sentences is/are invalid. 

“The forces on the masses point in the y-direction. Therefore, there is no 
change in momentum in the x-direction. But the mass of the resulting blob 
is greater than the sum of the initial masses (because they collided with some 
relative speed). Therefore, the speed of the resulting blob must be less than v 
(to keep p x constant), so the whole apparatus slows down in the x-direction.” 

Section 11.6: Rocket motion 

18. Rocket energy ** 

As mentioned at the end of the first solution to the rocket problem in Section 
11.6, the energy of the rocket in the ground frame equals M/( 1 + v). Derive 
this result again, by integrating up the amount of energy that the photons 
have in the ground frame. 
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Section 11.7: Relativistic strings 

19. Two masses * 

A mass m is placed right in front of an identical one. They are connected by a 
relativistic string with tension T. The front one suddenly acquires a speed of 
3c/5. How far from the starting point will the masses collide with each other? 
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Figure 11.20 


11.10 Problems 


Section 11.1: Energy and momentum 

1. Deriving E and p ** 

Accepting the facts that the energy and momentum of a photon are E = hu 
and p = hv/c (where v is the frequency of the light wave, and h is Planck’s 
constant), derive the relativistic formulas for the energy and momentum of a 
massive particle, E = "/me 2 and p = jmv. Hint: Consider a mass m that 
decays into two photons. Look at this decay both in the rest frame of the 
mass, and in a frame where the mass has speed v. You’ll need to use the 
Doppler effect. 


Section 11.3: Collisions and decays 

2. Colliding photons 

Two photons each have energy E. They collide at an angle 6 and create a 
particle of mass M. What is Ml 

3. Increase in mass 

A large mass M, moving with speed V. collides and sticks to a small mass 
m, initially at rest. What is the mass of the resulting object? Work in the 
approximation where M m. 

4. Compton scattering ** 

A photon collides with a stationary electron. If the photon scatters at an angle 
6 (see Fig. 11.20), show that the resulting wavelength, A', is given in terms of 
the original wavelength, A, by 

A' = A+ — (1 -cos0), (11.81) 

me 

where m is the mass of the electron. Note: The energy of a photon is E = 
hv = hc/\. 


5. Bouncing backwards ** 

A ball of mass M and energy E collides head-on elastically with a stationary 
ball of mass m. Show that the final energy of mass M is 


2 mM 2 + E(m 2 + M 2 ) 
m 2 + M 2 + 2 Em 


(11.82) 


Hint: This problem is a little messy, but you can save yourself a lot of trouble 
by noting that E' = E must be a root of an equation you get for E'. (Why?) 


6. Two-body decay * 

A mass Ma decays into masses Mb and Me- What are the energies of Mb 
and Mcl What are their momenta? 
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7. Threshold energy * 

A particle of mass m and energy E collides with an identical stationary par¬ 
ticle. What is the threshold energy for a final state containing N particles of 
mass m? (“Threshold energy” is the minimum energy for which the process 
occurs.) 

Section 11.5: Force 

8. Relativistic harmonic oscillator ** 

A particle of mass m moves along the x-axis under a force F = —mou 2 x. The 
amplitude is b. Show that the period is given by 

T =~ t ~r^M dx ' ( 1L83 ) 

C Jo V7 2 - 1 

where 

7=l + ^(& 2 -* 2 ) 


9. System of masses ** 

Consider a dumbbell made of two equal masses, m. The dumbbell spins ! 

around, with its center pivoted at the end of a stick (see Fig. 11.21). If 1 _ n l... 

the speed of the masses is v, then the energy of the system is 2'jm. Treated Figure 11.21 

as a whole, the system is at rest. Therefore, the mass of the system must be 
2q m. (Imagine enclosing it in a box, so that you can’t see what is going on 
inside.) 

Convince yourself that the system does indeed behave like a mass of M = 2'jm , 
by pushing on the stick (when the dumbbell is in the “transverse” position 
shown in the figure) and showing that F = dp/dt = Ma. 

Section 11.6: Rocket motion 

10. Relativistic rocket ** 

Consider the relativistic rocket from Section 11.6. Let mass be converted to 
photons at a rate a in the rest frame of the rocket. Find the time, t, in the 
ground frame as a function of v. Ui (Alas, it is not possible to invert this, to 
get v as a function of t.) 

11. Relativistic dustpan I * 

A dustpan of mass M is given an initial relativistic speed. It gathers up dust 
with mass density A per unit length on the floor (as measured in the lab frame). 

At the instant the speed is v, find the rate (as measured in the lab frame) at 
which the mass of the dustpan-plus-dust-inside system is increasing. 

16 This involves a slightly tricky integral. Pick your favorite method - pencil, book, or computer. 


(11.84) 
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T 

Figure 11.22 



Figure 11.23 


12. Relativistic dustpan II ** 

Consider the setup in Problem 11. If the initial speed of the dustpan is V, 
what are v(x), v(t). and x(t)7 All quantities here are measured with respect 
to the lab frame. 

13. Relativistic dustpan III ** 

Consider the setup in Problem 11. Calculate, in both the dustpan frame and 
lab frame, the force on the dustpan-plus-dust-inside system (due to the newly 
acquired dust particles smashing into it) as a function of v, and show that the 
results are equal. 

14. Relativistic cart I **** 

A long cart moves at relativistic speed v. Sand is dropped into the cart at a 
rate dm/dt = a in the ground frame. Assume that you stand on the ground 
next to where the sand falls in, and you push on the cart to keep it moving 
at constant speed v. What is the force between your feet and the ground? 
Calculate this force in both the ground frame (your frame) and the cart frame, 
and show that the results are equal. 

15. Relativistic cart II **** 

A long cart moves at relativistic speed v. Sand is dropped into the cart at a 
rate dm/dt = a in the ground frame. Assume that you grab the front of the 
cart and pull on it to keep it moving at constant speed v (while running with 
it). What force does your hand apply to the cart? (Assume that the cart is 
made of the most rigid material possible.) Calculate this force in both the 
ground frame and the cart frame (your frame), and show that the results are 
equal. 

Section 11.6: Relativistic strings 

16. Different frames ** 

(a) Two masses m are connected by a string of length l and constant ten¬ 
sion T. The masses are released simultaneously. They collide and stick 
together. What is the mass, M, of the resulting blob? 

(b) Consider this scenario from the point of view of a frame moving to the 
left with speed v (see Fig. 11.22). The energy of the resulting blob 
must be 7 Me 2 , from part (a). Show that you obtain this same result by 
computing the work done on the two masses. 

17. Splitting mass ** 

A massless string with constant tension T has one end attached to a wall and 
the other end attached to a mass M. The initial length of the string is I (see 
Fig. 11.23). The mass is released. Halfway to the wall, the back half of the 
mass breaks away from the front half (with zero initial relative speed). What 
is the total time it takes the front half to reach the wall? 
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18. Relativistic leaky bucket *** 

The mass M in Problem 17 is replaced by a massless bucket containing an ini¬ 
tial mass M of sand (see Fig. 11.24). On the way to the wall, the bucket leaks 
sand at a rate dm/dx = M/t , where m denotes the mass at later positions. 
(Note that the rate is constant with respect to distance, not time.) 

(a) What is the energy of the bucket, as a function of distance from the wall? 
What is its maximum value? What is the maximum value of the kinetic 
energy? 

(b) What is the momentum of the bucket, as a function of distance from the 
wall? Where is it maximum? 

19. Relativistic bucket *** 

(a) A massless string with constant tension T has one end attached to a wall 
and the other end attached to a mass m. The initial length of the string 
is £ (see Fig. 11.25). The mass is released. How long does it take to 
reach the wall? 

(b) Let the string now have length 2£, with a mass m on the end. Let another 
mass m be positioned next to the £ mark on the string, but not touching 
the string (see Fig. 11.26). The right mass is released. It heads toward 
the wall (while the other mass is still motionless), and then sticks to the 
other mass to make one large blob, which then heads toward the wall. 17 
How much time does this whole process take? 18 

(c) Let there now be N masses and a string of length N£ (see Fig. 11.27). 
How much time does this whole process take? 

(d) Consider now a massless bucket at the end of the string (of length L) 
which gathers up a continuous stream of sand (of total mass M), as it 
gets pulled to the wall (see Fig. 11.28). How much time does this whole 
process take? What is the mass of the contents of the bucket right before 
it hits the wall? 


I _ 

T 


Figure 11.24 


/ _ m 

T 

Figure 11.25 


m 


Figure 11.26 


Figure 11.27 



Figure 11.28 


17 The left mass could actually be attached to the string, and we would still have the same 
situation. The mass wouldn’t move during the first part of the process, because there would be 
equal tensions T on both sides of it. 

18 You can do this in various ways, but one method that generalizes nicely for the next part is to 
show that A(p 2 ) = (E% — E J) + (1?| — E$), where the energies of the moving object (that is, the 
initial m or the resulting blob) are: E\ right at the start, E2 just before the collision, E3 just after 
the collision, and E4 right before the wall. Note that this method does not require knowledge of 
the mass of the blob (which is not 2m). 
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11.11 Solutions 


y 



Figure 11.29 


1. Deriving E and p 

We’ll derive the energy formula, E = 7 me 2 , first. Let the given mass decay into two 
photons, and let Eq be the energy of the mass in its rest frame. Then each of the 
resulting photons has energy Eo/2 in this frame. 

Now look at the decay in a frame where the mass moves at speed v. From eq. (10.48), 
the frequencies of the photons are Doppler-shifted by the factors v) and 

,T|l ~ v ). Since the photons’ energies are given by E = hv, their energies 

are shifted by these same factors, relative to the Eo/2 value in the original frame. 
Conservation of energy then says that in the moving frame, the mass (which is moving 
at speed v) has energy 


E = 


Eo / !+«%-> 

2 V 1-v + 


Eo j 1-v 

TV 1 - v 


iEq. 


(11.85) 


We therefore see that a moving mass has an energy that is 7 times its rest energy. 
We will now use the correspondence principle (which says that relativistic formulas 
must reduce to the familiar nonrelativistic ones, in the nonrelativistic limit) to find 
Eo in terms of m. We just found that the difference between the energies of a moving 
mass and a stationary mass is jEo — Eo- This must reduce to the familiar kinetic 
energy, mv 2 / 2, in the limit »<c. In other words, 


y/l - v 2 /c 2 



( 11 . 86 ) 


where we have used the Taylor series, 1 /v/1 — e « 1 + e/2. Therefore Eq = me 2 , and 
hence E = jmc 2 . 

We can derive the momentum formula, p = jmv, in a similar way. Let the magnitude 
of the photons’ (equal and opposite) momenta in the particle’s rest frame be po/2. 19 
Using the Doppler-shifted frequencies as above, we see that the total momentum of 
the photons in the frame where the mass moves at speed v is 


P = 




IPov. 


(11.87) 


Putting the c’s back in, we have p = jpqv/c. By conservation of momentum, this is 
the momentum of the mass to moving at speed v. 

We can now use the correspondence principle to find po in terms of to. If p = "f(po/c)v 
is to reduce to the familiar p = mv result in the limit v <C c, then we must have 
Po = me. Therefore, p = 7 mv. 

2. Colliding photons 

The 4-momenta of the photons are (see Fig. 11.29) 


19 With the given information that a photon has E = hv and p = hv/c, we can use the preceding 
Eo = me 2 result to quickly conclude that po = me. But let’s pretend that we haven’t found Eo yet. 
This will give us an excuse to use the correspondence principle again. 
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P 71 =(E,E, 0,0), and P 72 = (E,E cos #, Esin 6 », 0). (11.88) 

Energy and momentum are conserved, so the 4-momentum of the final particle is 
P M = (2E, E + E cos 9, Esin 0, 0). Hence, 

M 2 = P M - Pm = (2 E) 2 - (E + EcosO) 2 - (EsinO) 2 . (11.89) 

Therefore, the desired mass is 


M = £V2( 1-cos0) . (11.90) 

If 0 = 180° then M = 2 E, as it should (none of the final energy is kinetic). And if 0 = 0° 
then M = 0, as it should (all of the final energy is kinetic; we simply have a photon with 
twice the energy). 

3. Increase in mass 

In the lab frame, the energy of the resulting object is 7 M + to, and the momentum 
is still 7 MV. The mass of the object is therefore 

M' = sJ(p(M + m ) 2 - {'iMV) 2 = y/M 2 + 27 Mm + m 2 . (11.91) 

The m 2 term is negligible compared to the other two terms, so we may approximate 
M' as 

M, ~ M /l + 2 F~ M (l + ¥) =M + 7 to, (11.92) 

where we have used the Taylor series, sf\ — i « 1 + e/2. Therefore, the increase in 
mass is 7 times the mass of the stationary object. (This increase must be greater than 
the nonrelativistic answer of “to” , because heat is generated during the collision, and 
this heat shows up as mass in the final object.) 

Remark: The 7 m result is quite clear if we work in the frame where M is initially at rest. 
In this frame, the mass m comes flying in with energy 7 m, and essentially all of this energy 
shows up as mass in the final object. That is, essentially none of it shows up as overall 
kinetic energy of the object. 

This is a general result. Stationary large objects pick up negligible kinetic energy when hit 
by small objects. This is true because the speed of the large object is proportional to m/M, 
by momentum conservation (there’s a factor of 7 if things are relativistic), so the kinetic 
energy goes like Mv 2 oc M(m/M) 2 w 0, if M > m. In other words, the smallness of v wins 
out over the largeness of M. When a snowball hits a tree, all of the initial energy goes into 
heat to melt the snowball; (essentially) none of it goes into changing the kinetic energy of 
the earth. Jit 

4. Compton scattering 

The 4-momenta before the collision are (see Fig. 11.30) 

P m = (me 2 , 0,0,0). (11.93) 


P' m = (we won't need this). (11.94) 


Mt-t-°-°) 


The 4-momenta after the collision are 
7 sin#, 0 ^ , 


(he he 

p - = U’y c ' 


X 



Figure 11.30 
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If we wanted to, we could write P' m in terms of its momentum and scattering angle. 
But the nice thing about the following method is that we don’t need to introduce 
these quantities which we’re not interested in. 

Conservation of energy and momentum give P 7 + P rn = P( + P ' m . Therefore, 


(P 7 + P m - P() 2 

=mP 2 + P^ + P' 2 + 2P m (P 7 - Pfy - 2P 1 P( 

o a _ r. 9 (he hc\ he he. 

>0 + m 2 c 4 + 0 + 2 me 2 2—— (1 - cos6>) 


P' 2 

P' 2 

m 2 c 4 . (11.95) 


Multiplying through by XX' / ( hmc 3 ) gives the desired result, 

A' = A+ — (l-cos6>). (11.96) 

me 

The nice thing about this solution is that all the unknown garbage in P' m disappeared 
when we squared it. 


If 0 « 0 (that is, not much scattering), then A' « A, as expected. 

If 9 = 7r (that is, backward scattering) and additionally A <C h/mc (that is, me 2 <C he/X = 
P 7 , so the photon’s energy is much larger than the electron’s rest energy), then A' — 2 h/mc, 




(11.97) 


Therefore, the photon bounces back with an essentially fixed E(, independent of the initial 
E 1 (as long as E 1 is large enough). This isn’t all that obvious. 


5. Bouncing backwards 

The 4-momenta before the collision are 


P M = (P,p, 0 , 0 ), P m = (rn, 0 , 0 , 0 ), (11.98) 

where p = \JE 1 — M' 2 . The 4-momenta after the collision are 

P' M = (E',p r , 0,0), P^ = (we won't need this), (11.99) 

where p' = \JE n — M 2 . If we wanted to, we could write P' m in terms of its momentum. 
But we don’t need to introduce it. Conservation of energy and momentum give 
Pm + P m = P'm + Pin- Therefore, 

{P M + P m -P' M f = P^ 

P 2 M +Pl + P'*+ 2 P m {P M - P'm) ~ 2P M P' M = P'm (H-100) 

=> M 2 + to 2 + M 2 + 2 m{E - E') - 2 {EE' - pp') = m 2 
=> M 2 — EE' + m{E — E') = pp' 

pm [{M 2 - EE') + m{E - E')Y = (V E 2 - M 2 \JE' 2 - M 2 ^ 

=m M 2 {E 2 - 2 EE' + E' 2 ) + 2(M 2 - EE')m{E - E') + m 2 {E - E') 2 = 0. 

As claimed, E' = E is a root of this equation. This is true because E' = E and p' = p 
certainly satisfy conservation of energy and momentum with the initial conditions, by 
definition. Dividing through by {E — E') gives 

M 2 {E - E') + 2 m{M 2 - EE') + m 2 {E - E') = 0. 


( 11 . 101 ) 
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Solving for E' gives the desired result, 

_ 2toM 2 + E(m 2 + M 2 ) 
“ m 2 + M 2 + 2 Em 

We can double-check a few limits: 


( 11 . 102 ) 


(a) E m M (barely moving): then E' « M, because M is still barely moving. 

(b) m^> E (brick wall): then E' « E, because the heavy mass m picks up essentially no 


(c) M S> m\ then E' « E, because it’s essentially like m is not there. Actually, this only 
holds if E isn’t too big; more precisely, we need Em <C M 2 . 

(d) M = m\ then E' = M, because M stops and m picks up all the energy that M had. 

(e) E m M\ then E' « m/2. This isn’t obvious, but it’s similar to an analogous 
limit in the Compton scattering in Problem 4. 

6. Two-body decay 

B and C have equal and opposite momenta. Therefore, 

~ Mg = p 2 = Eq — Mq. (11.103) 

Also, conservation of energy gives 

E b + E c = M a . (11.104) 

Solving the two previous equations for E B and Ec gives (using the shorthand a = M A , 
etc.) 

a 2 + b 2 - c 2 a 2 + c 2 - b 2 

E b = ^ -, and E c = -. (11.105) 

Eq. (11.103) then gives the momentum of the particles as 


P 


Va 4 + + c 4 - 2 a 2 b 2 - 2 a 2 <? - ZtPc 2 

2a 


(11.106) 


Remark: It turns out that the quantity under the radical may be factored into 

(a + b + c)(a + b - c)(a - b + c){a -b-c). (11.107) 

This makes it clear that if a = ,&# c, then p = 0, because there is no leftover energy for the 
particles to be able to move. & 

7. Threshold energy 

The initial 4-momenta are 

(E,p, 0,0), and (m, 0,0,0), (11.108) 

where p = yjE 2 — to 2 . Therefore, the final 4-momentum is ( E + to, p, 0,0). The 
quantity (E + to) 2 — p 2 is an invariant, and it equals the square of the energy in 
the CM frame. At threshold, there is no relative motion among the final N particles 
(because there is no leftover energy for such motion; see the remark below). So the 
energy in the CM frame is simply the sum of the rest energies, or Nm. We therefore 
have 

(E + to ) 2 — (E 2 — to 2 ) = (Nm) 2 => E = (^ —1^ to. (11.109) 
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Note that E oc N 2 , for large N. 


Remark: Let’s justify rigorously that the final particles should travel as a blob (that is, 
with no relative motion). Using the invariance of E 2 — p 2 , and the fact that p CM = 0, we 

m 2 -(^ 2 = r ) 2 -( h ° M ) 2 

=> (E + m ) 2 -( VE 2 - m^) 2 = (E { CM ) 2 - 0 

=# 2Em + 2rn = (U f CM ) 2 . (11.110) 

Therefore, minimizing E is equivalent to minimizing Ep M . But Ep M is clearly minimized 
when all the final particles are at rest in the CM frame (so there is no kinetic energy added 
to the rest energy). The minimum E is therefore achieved when there is no relative motion 
among the final particles in the CM frame, and hence in any other frame. A 

8. Relativistic harmonic oscillator 

F = dp/dt gives —muj 2 x = d(m , yv)/dt. Using eq. (11.44), we have 


- wx = 7 Tf 


(ii.ni) 


We must somehow solve this differential equation. A helpful thing to do is to multiply 
both sides by v to obtain —u) 2 xx = The right-hand side of this is simply d'y/dt, 

as you can check. Integration then gives —u 2 x 2 / 2 + C = 7, where C is a constant of 
integration. We know that 7 = 1 when x = b, so we find 


= 1 - 


2c 2 




where we have put the c’s back in to make the units right. 
The period is given by 

T = 4 f b — . 

Jo v 

But 7 = 1/a/1 — v 2 /c 2 , and so v = T/7. Therefore, 

T = - / 7 dx. 

cJo vV - 1 


( 11 . 112 ) 


(11.113) 


(11.114) 


Remark: In the limit utb <§; c (so that 7 « 1, from eq. (11.112); that is, the speed is 
always small), we must recover the Newtonian limit. Indeed, to lowest nontrivial order, 
7 2 « 1 + ( w 2 /c 2 )(6 2 - x 2 ), and so 


•tf; 


(11.115) 


This is the correct result, because conservation of energy gives v 2 = u> 2 (b 2 — x 2 ) for a 
nonrelativistic spring. £ 


9. System of masses 

Let the speed of the stick go from 0 to e, where £<v. Then the final speeds of the two 
masses are obtained by relativistically adding and subtracting e from v. (Assume that 
the time involved is small, so that the masses are still essentially moving horizontally.) 
Repeating the derivation leading to eq. (11.17), we see that the final momenta of the 
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two masses have magnitudes ’~{ v ''/ f _(v ± e)m. But since e is small, we may set ss 1, 
to first order. 

Therefore, the forward-moving mass now has momentum 7 V (v+e)m, and the backward- 
moving mass now has momentum — 7 „(n — e)m. The net increase in momentum is 
thus (with 7 v = 7 ) A p = 27 me. Hence, 

Ar) e 

F = — = 27 to— = 27 ma = Ma. (11.116) 


10. Relativistic rocket 

The relation between to and v obtained in eq. (11.60) is independent of the rate at 
which mass is converted to photons. We now assume a certain rate, in order to obtain 
a relation between v and t. 

In the frame of the rocket, we have dm/dr = —a. From the usual time dilation effect, 
we then have dm/dt = — < 7/7 in the ground frame, because the ground frame sees the 
rocket’s clocks run slow (that is, dt = 7 dr). 

Differentiating eq. (11.60), we have 


Using dm = —(a/j)dt, this becomes 


l M dt l 


( 1 + n)(!- 


(11.117) 


(11.118) 


We could simply use a computer to do this dv integral, but let’s do it from scratch. 
Using a few partial-fraction tricks, we have 


/ 


-h 


(1 + U)(1-U2) J (1 + U)(1-U)(1 + U) 

1 f ( 1 1 \ dv 

2 ] (rr^ + r^j r+^ 

If dv 1 f ( 1 1 . \, 

- 2] (TT^ + i J (iT» + TV ») d 
1 , + 


2(1 + v) 4 


(11.119) 


Equation (11.118) therefore gives 


at 

M 


2(1 4 


( 11 . 120 ) 


Remarks: If v <C 1 (or rather, if v <C c), then we may Taylor-expand the quantities in 
eq. (11.120) to obtain at/M « v. This may be rewritten as <r rj M(v/t) = Ma. But a is 
simply the force acting on the rocket (or rather ac, to make the units correct), because this 
is the change in momentum of the photons. We therefore obtain the expected nonrelativistic 
F — ma equation. 

If vrm 1 — e, where e is very small (that is, if v is very close to c), then we can make 
approximations in eq. (11.120) to obtain e « 2e/e 4<Tt ^ M . We see that the difference between 
v and 1 decreases exponentially with I. $ 
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11. Relativistic dustpan I 

This problem is essentially the same as Problem 3. 

Let M be the mass of the dustpan-plus-dust-inside system (which we will label “5”) 
when its speed is v. After a small time dt in the lab frame, S has moved a distance 
v dt, so it has basically collided with an infinitesimal mass Xv dt. Its energy therefore 
increases to 7 M + A vdt. Its momentum is still 7 Mv, so its mass is now 

M' = V(7 M + Xv dt) 2 - ( 7 M ®) 2 « \/M 2 + 2 1 M\v dt, (11.121) 

where we have dropped the second-order dt 2 terms. Using the Taylor series \/l + e w 
1 + e/2, we may approximate M' as 

M'*M ] /AAL^ aM (l + ^yM + 1 \vdt. (11.122) 

The rate of increase in 5”s mass is therefore 7 An. (This increase must certainly be 
greater than the nonrelativistic answer of “An”, because heat is generated during the 
collision, and this heat shows up as mass in the final object.) 


Remarks: This result is quite clear if we work in the frame where S is at rest. In this frame, 
a mass Xv dt comes flying in with energy jXv dt, and essentially all of this energy shows up 
as mass (heat) in the final object. That is, essentially none of it shows up as overall kinetic 
energy of the object, which is a general result when a small object hits a stationary large 
object. 

Note that the rate at which the mass increases, as measured in S ’s frame, is j 2 Xv, due to 
time dilation. (The dust-entering-dustpan events happen at the same location in the dustpan 
frame, so we have indeed put the extra 7 factor in the correct place.) Alternatively, you can 
view things in terms of length contraction. S sees the dust contracted, so its density is 
increased to 7A. 

12. Relativistic dustpan II 

The initial momentum is 7 v MV = P. There are no external forces, so the momentum 
of the dustpan-plus-dust-inside system (denoted by “S”) always equals P. That is, 
7 mv = P, where m and v are the mass and speed of S at later times. 

Let’s find v(x) first. The energy of S, namely 7 to, increases due to the acquisition of 
new dust. Therefore, d('ym) = Xdx, which we can write as 


d 


= Xdx. 


(11.123) 


Integrating this, and using the fact that the initial speed is V, gives P/v — P/V = Xx. 
Therefore, 

'•(,r) . (11-124) 

1+ P 

Note that for large x, this approaches P/(Xx). This makes sense, because the mass 
of S is essentially equal to Xx, and it is moving at a slow, nonrelativistic speed. 

To find v(t), write the dx in eq. (11.123) as v dt to obtain (—P/v 2 ) dv = Xv dt. Hence, 



(11.125) 
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At this point, there are various ways to find x(t). The simplest one is to just integrate 
eq. (11.125). The result is 


z(t) 


(11.126) 


You can show that this reduces to x = Vt for small t, as it should. For large t, x has 
the interesting property of being proportional to \ft. 

13. Relativistic dustpan III 

Let S denote the dustpan-plus-dust-inside system at a given time, and consider a 
small bit of dust (call this subsystem s) that enters the dustpan. In S’ s frame, the 
density of the dust is 7 A, due to length contraction. Therefore, in a time dr (where r 
is the time in the dustpan frame), a little s system of dust with mass 7 A vdr crashes 
into S and loses its (negative) momentum of — U/Av dr) (yv) = —y 2 v 2 A dr. The force 
on s is therefore +-/ 2 v 2 X. The desired force on S is equal and opposite to this, so 

F = - 7 VA. (11.127) 


Now consider the lab frame. In a time dt (where t is the time in the lab frame), a 
little s system of dust with mass Av dt gets picked up by the dustpan. What is the 
change in momentum of s? It is tempting to say that it is (Avdf)( 7 v), but this would 
lead to a force of — 7 V 2 A on the dustpan, which doesn’t agree with the result we found 
above in the dustpan frame. This would be a problem, because longitudinal forces 
should be the same in different frames. 

The key point to realize is that the mass of whatever is moving increases at a rate 
7 At;, and not Av (see Problem 11). We therefore see that the change in momentum of 
the additional moving mass is ('yAv dt)(^v) = j 2 v 2 Adt,. The original moving system 
S therefore loses this much momentum, and so the force on it is F = —j 2 v 2 A, in 
agreement with the result in the dustpan frame. 

14. Relativistic cart I 

Ground frame (your frame): Using reasoning similar to that in Problem 3 or 
Problem 11, we see that the mass of the cart-plus-sand-inside system increases at a 
rate 7 <r. Therefore, its momentum increases at a rate 

= 7(7 a)v = 7 2 av. (11.128) 

dt 

This is the force you exert on the cart, so it is also the force the ground exerts on 
your feet (because the net force on you is zero). 

Cart frame: The sand-entering-cart events happen at the same location in the 
ground frame, so time dilation says that the sand enters the cart at a slower rate in 
the cart frame; that is, at a rate < 7 / 7 . The sand flies in at speed v, and then eventually 
comes at rest on the cart, so its momentum decreases at a rate 7 (^/ 7 )^ = av. This 
is the force your hand applies to the cart. 

If this were the only change in momentum in the problem, then we would be have a 
problem, because the force on your feet would be av in the cart frame, whereas we 
found above that it is ^ 2 av in the ground frame. This would contradict the fact that 
longitudinal forces are the same in different frames. What is the resolution to this 
apparent paradox? 
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The resolution is that while you are pushing on the cart, your mass is decreasing. You 
are moving with speed v in the cart frame, and mass is continually being transferred 
from you (who are moving) to the cart (which is at rest). This is the missing change 
in momentum we need. Let’s be quantitative about this. 

Go back to the ground frame for a moment. We found above that the mass the 
cart-plus-sand-inside system (call this “C”) increases at rate 7a in the ground frame. 
Therefore, the energy of C increases at a rate 7(7cr) in the ground frame. The sand 
provides a of this energy, so you must provide the remaining (j 2 — l)o part. Therefore, 
since you are losing energy at this rate, you must also be losing mass at this rate in 
the ground frame (because you are at rest there). 

Now go back to the cart frame. Due to time dilation, you lose mass at a rate of only 
(7 2 — 1)(t /'y. This mass goes from moving at speed v (that is, along with you), to 
speed zero (that is, at rest on the cart). Therefore, the rate of decrease in momentum 
of this mass is 7((7 2 — l)cr/7)u = (7 s — 1 )<tu. 

Adding this result to the av result due to the sand, we see that the total rate of 
decrease in momentum is 7 2 av. This is therefore the force that the ground applies to 
your feet, in agreement with the calculation in the ground frame. 

15. Relativistic cart II 

Ground frame: Using reasoning similar to that in Problem 3 or Problem 11, we see 
that the mass of the cart-plus-sand-inside system increases at a rate 7 a. Therefore, 
its momentum increases at a rate 7(7 a)v = 7 2 av. 

However, this is not the force that your hand exerts on the cart. The reason is that 
the sand enters the cart at locations that are receding from your hand, so your hand 
cannot immediately be aware of the additional need for momentum. No matter how 
rigid the cart is, it can’t transmit information faster than c. In a sense, there is a sort 
of Doppler effect going on, and your hand only needs to be responsible for a certain 
fraction of the momentum increase. Let’s be quantitative about this. 

Consider two grains of sand that enter the cart a time t apart. What is the difference 
between the two times that your hand becomes aware that the grains have entered 
the cart? Assuming maximal rigidity (that is, assuming that signals propagate along 
the cart at speed c), then the relative speed (as measured by someone on the ground) 
of the signals and your hand is c — v. The distance between the two signals is ct. 
Therefore, they arrive at your hand separated by a time of ct/(c — v). In other words, 
the rate at which you feel sand entering the cart is (c — v )/c times the given cr rate. 
This is the factor by which we must multiply the naive 7 2 av result for the force we 
found above. The force you must apply is therefore 

M 1 -;h v ” = iT7 (U129) 

Cart frame (your frame): The sand-entering-cart events happen at the same 
location in the ground frame, so time dilation says that the sand enters the cart at 
a slower rate in the cart frame; that is, at a rate cr/7. The sand flies in at speed v, 
and then eventually comes to rest on the cart, so its momentum decreases at a rate 
7 (<t/ 7 )u = av. 

But again, this is not the force that your hand exerts on the cart. As before, the sand 
enters the cart at a location far from your hand, so your hand cannot immediately be 
aware of the additional need for momentum. Let’s be quantitative about this. 
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Consider two grains of sand that enter the cart a time t apart. What is the difference 
between the two times that your hand becomes aware that the grains have entered 
the cart? Assuming maximal rigidity (that is, assuming that signals propagate along 
the cart at speed c), then the relative speed (as measured by someone on the cart) of 
the signals and your hand is c (because you are at rest). The distance between the 
two signals is ct + vt, because the sand source is moving away from you at speed v. 
Therefore, the signals arrive at your hand separated by a time of (c + v)t/c. In other 
words, the rate at which you feel sand entering the cart is c/(c + v) times the <7/7 
rate found above. This is the factor by which we must multiply the naive av result 
for the force we found above. The force you must apply is therefore 



in agreement with eq. (11.129). 

In a nutshell, the two naive results in the two frames, 7 2 crv and av, differ by two 
factors of 7. The ratio of the two “Doppler-effect” factors (which arose from the 
impossibility of absolute rigidity) precisely remedies this discrepancy. 

16. Different frames 

(a) The energy of the resulting blob is 2m + Tl. Since the blob is at rest, we have 

M = 2m + Tt. (11.131) 

(b) Let the new frame be S. Let the original frame be S'. The critical point to 
realize is that in frame S the left mass begins to accelerate before the right mass 
does. This is due to the loss of simultaneity between the frames. Note that the 
longitudinal force is the same in the two frames, so the masses still feel a tension 
T in frame S. 

Consider the two events when the two masses start to move. Let the left mass 
and right mass start moving at positions Xi and x r in S. The Lorentz transfor¬ 
mation Ax = j(Ax' + vAt') tells us that x r — x/ = 7 1, because Ax' = t and 
At' = 0 for these events. 

Let the masses collide at position x c in S. Then the gain in energy of the left 
mass is T(x c — X/), and the gain in energy of the right mass is (—T)( x c — x r ) 
(which is negative if x c > x r ). The gain in the sum of the energies is therefore 

A E = T(x c - *,) + {-T){x c - x r ) = T(x r - %) = Tjl (11.132) 

The initial sum of energies was 27 m, so the final energy is 

E = 27 m + 7 T£ = 7 M, (11.133) 


as desired. 

17. Splitting mass 

We’ll calculate the times for the two parts of the process to occur. 

The energy of the mass fight before it s plits is Eb = M + T(£/ 2), so the momentum 
is pb = ^ El — M 2 = \J MT£ TT' 2 £ 2 /I- Using F = dp/dt, the time for the first part 
of the process is 


h = -y/Mfe + rW/i. 


(11.134) 
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The momentum of the front half of the mass immediately after it splits is p a = p b /2 = 
(l/2}^MK + P#/4. The energy at the wall is E w = E b /2+T(£/2) = M/2+37T/4, 
so the momentum at the wall is p w = i /Eft, — (M/2) 2 = (1 /2) AMTl + TT’ 2 £ 2 j'A. 
The change in momentum during the second part of the process is therefore A p = 
p w -p a = (1 /2Jtv"3 MTt +■ (1 /2) y/MTl The time for the second 

part is thus 

t 2 = ^3MT£ + 9T 2 £ 2 /4 - yjMTl + T 2 £ 2 /4^j . (11.135) 

The total time is ti+t 2 , which simply changes the minus sign in the above expression 
to a plus sign. 

18. Relativistic leaky bucket 


(a) Let the wall be at x = 0, and let the initial position be at x = £. Consider a small 
interval during which the bucket moves from a: to x + dx (where dx is negative). 
The bucket’s energy changes by (—T) dx due to the string, and it also changes 
by a fraction dx/x, due to the leaking. Therefore, dE = (-T) dx + E dx/x, or 


dE 

dx 


(11.136) 


In solving this differential equation, it is convenient to introduce the variable 
y = E/x. Then E' = xy' + y, where a prime denotes differentiation with respect 
to x. Eq. (11.136) then becomes xy' = —T, or 


dy 


—Tdx 


(11.137) 


Integration gives y = —Tlnaj + C, which we may write as y = —Tln(x/£) + B, in 
order to have a dimensionless argument in the log. Since E = xy, we therefore 
have 

E = Bx — Tx\n(x/£). (11.138) 

The reasoning up to this point is valid for both the total energy and the kinetic 
energy. Let’s now look at each of these cases. 


• Total energy: Eq. (11.138) gives 

E = M(xjT) - Tx \n(x/l), (11.139) 

where the constant of integration, B, has been chosen so that E = M when 
x = £. To find the maximum of E, it is more convenient to work with the 
fraction z = x/£, in terms of which E = Mz — T£z\nz. Setting dE/dz 
equal to zero gives 

\nz =— n =* Emax = —e M ' Ti . (11.140) 

T£ e 

The fraction 2 must satisfy z < 1, so we must have In z < 0. Therefore, a 
solution for 2 exists only for M < T£. If M >T£, then the energy decreases 
all the way to the wall. 

If M is slightly less then T£, then « is slightly less than 1, so £ quickly achieves a 
maximum of slightly more than M, then decreases for the rest of the way to the 

If M < Tf, then E achieves its maximum at 2 « 1/e, where it has the value 
Tl/e. 
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• Kinetic energy: Eq. (11.138) gives 

KE = —Txln(x/l), (11.141) 


where the constant of integration, B, has been chosen so that KE = 0 
when x = l. Equivalently, E — KE must equal the mass M(xJT). In terms 
of the fraction z = x/l, we have KE = —Tlzlnz. Setting d(KE)/dz equal 
to zero gives 



e 


KE max 


Tl 
e ’ 


(11.142) 


which is independent of M. Since this result is independent of M, it must 
hold in the nonrelativistic limit. And indeed, the analogous “Leaky-bucket” 
problem in Chapter 4 (Problem 4.16) gave the same result. 


(b) Eq. (11.139) gives, with z = x/I, 


p = \/E 2 - ( Mz) 2 = ^(Mz-Tlz\nz) 2 -(Mz) 2 

= V-ZMTIz 2 In z + T 2 l 2 z 2 In 2 *. (11.143) 

Setting the derivative equal to zero gives Tl In 2 2 + (Tl — 2 M) Inz — M = 0. 
The maximum momentum therefore occurs at 


2 M-Tl- \/T 2 l 2 + AM 2 


(11.144) 


We have ignored the other root, because it gives Inz > 0. 


If M -C Tl, then the maximum p occurs atz« 1/e. In this case, the bucket immedi¬ 
ately becomes relativistic, so we have E ~ pc. Therefore, both E and p should achieve 
their maxima at the same place. This agrees with the result for E above. 

If M ^$> Tl, then the maximum p occurs at z « l/y^e. In this case, the bucket is 
nonrelativistic, so the result should agree with the analogous “Leaky-bucket” problem 
in Chapter 4 (Problem 4.16), which it does. 

19. Relativistic bucket 


(a) The mass’s energy just before it hits the wall is E = m + Tl. Therefore, the 
momentum just before it hits the wall is p => \f E? — in 2 = \/2 mTl + T 2 l 2 . 
F = Ap/At then gives (using the fact that the tension is constant) 

f = (l 1 . 1 45) 

If m Tl, then At « l (or l/c in normal units), which is correct, because the mass 
essentially travels at speed c. 

If m> Tl, then At « ml/T. This is the nonrelativistic limit, and it agrees with 
the result obtained from the familiar l = at 2 /2, where a = T/m is the acceleration. 

(b) Straightforward method: The energy of the blob right before it hits the wall 
is E w = 2m + 2 Tl. If we can find the mass, M, of the blob, then we can use 
p = \JE 2 — M 2 to get the momentum, and then use At = Ap/F to get the 
time. 20 

20 Note that although the tension T acts on two different things (the mass m initially, and then 
the blob), it is valid to use the total Ap to obtain the total time At via At = A p/F, simply because 
we could break up the A p into its two parts, and then find the two partial times, and then add 
them back together to get the total At. 
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The momentum right before the collision is Pb = s/lmTi + T 2 ! 2 , and this is 
also the momentum of the blob right after the collision, p a . 

The energy of the blob right after the collision is E a = 2 m + TI. So the mass of 
the blob after the collision is M = y/E^—p 2 = \/4m 2 + 2 mTl. 

Therefore, the momentum at the wall is p w = \/E 2 } — M 2 = \/6rnT£ + IT 2 ? 2 , 
and hence _ 


A p V6mT£ + IT 2 ! 2 
~F ~ T 


(11.146) 


Note that if to = 0 then At = 2£, as it should. 


Better method: In the notation in the footnote in the statement of the problem, 
the change in p 2 from the start to just before the collision is A (p 2 ) = E\ — E 2 . 
This is true because 


E 2 — m 2 = p 2 , and E| — m 2 = p 2 , (11.147) 

and since m is the same throughout the first half of the process, we have A (E 2 ) = 

A (p 2 ). 

Likewise, the change in p 2 during the second half of the process is A (p 2 ) = 
E 2 — E 2 , because 

.Ef — M 2 = p\, and E 2 — M 2 = p 2 , (11.148) 

and since M is the same throughout the second half of the process, 21 we have 
A(£ 2 ) = A(p 2 ). 

The total change in p 2 is the sum of the above two changes, so the final p 2 is 
p 2 = (E 2 -E 2 ) + (E 2 -E 2 ) 

= ((to + TT) 2 - TO 2 ) + ((2to + 2 Tl) 2 - {2m + T £) 2 ) 

= 6mT£ + 4T 2 e 2 , (11.149) 


as in eq. (11.146). The first solution above basically performs the same calcu¬ 
lation, but in a more obscure manner. 

(c) The reasoning in part (b) tells us that the final p 2 equals the sum of the A(E 2 ) 
terms over the N parts of the process. So we have, using an indexing notation 
analogous to that in part (b), 


f> ! = E(4-4-i) 

k =1 

= ^2 (^{km + kT£) 2 - (km + (k - l)T£) 2 ^) 
= ( 2kmT( - + ( fc2 - (k - i) 2 )r 2 ^ 2 ) 

= N(N + l)mT£ + N 2 T 2 £ 2 . 

Therefore, 

A A P ojNjNAr 

~ ~Y _ T ' 

This checks with the results from parts (a) and (b). 


(11.150) 

(11.151) 


21 M happens to be \/4m 2 + 2mT£. but the nice thing about this solution is that we don’t need 
know this. All we need to know is that it is constant. 
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(d) We want to take the limit N —* oo, £ —> 0, m —> 0, with the restrictions that 
Nl = L and Nrn = M. Written in terms of M and L, the result in part (c) is 

±t _ \/(l l/N)MTL — T 2 L 2 _ VMTL + T 2 L 2 , (|U _ 2) 

as N —> oo. Note that this time equals the time is takes for one particle of mass 
m = M/2 to reach the wall, from part (a). 

The mass, Mf, of the final blob at the wall is 

,\I f = ^El - p 2 w = y/(M + TL ) 2 - (MTL + T 2 L 2 ) 

= y/M 2 + MTL. (11.153) 

If TL < C M, then Mf « M, which makes sense. If M TL, then Mf rts 
y/MTL, so Mf is the geometric mean between the given mass and the energy 
stored in the string, which isn’t entirely obvious. 
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Chapter 12 


4-vectors 

Copyright 2004 by David Morin, morin@physics.harvard.edu 

We now come to a very powerful concept in relativity, namely that of 4-vectors. 
Although it is possible to derive everything in special relativity without the use of 
4-vectors (and indeed, this is the route, give or take, that we took in the previous 
two chapters), they are extremely helpful in making calculations and concepts much 
simpler and more transparent. 

I have chosen to postpone the introduction to 4-vectors until now, in order to 
make it clear that everything in special relativity can be derived without them. 
In encountering relativity for the first time, it’s nice to know that no “advanced” 
techniques are required. But now that you’ve seen everything once, let’s go back 
and derive various things in an easier way. 

This situation, where 4-vectors are helpful but not necessary, is more pronounced 
in general relativity, where the concept of tensors (the generalization of 4-vectors) is, 
for all practical purposes, completely necessary for an understanding of the subject. 
We won’t have time to go very deeply into GR in Chapter 13, so you’ll have to just 
accept this fact. But suffice it to say that an eventual understanding of GR requires 
a firm understanding of special-relativity 4-vectors. 


12.1 Definition of 4-vectors 

Definition 12.1 The 4-tuplet, A = (Ao, Ai, A 2 , A3), is a “4-vector” if the Aj trans¬ 
form under a Lorentz transformation in the same way that ( cdt,dx,dy,dz ) do. In 
other words, they must transform like (assuming the LT is along the x-direction; see 
Fig. 12.1): 

A 0 = 7(A' 0 + (u/c)Ai), 

At = T{A! l + {vlc)A! 0 ), 

4'2 = A' 2 , 

A 3 = A3. (12.1) 

Remarks: 

1. Similar equations must hold, of course, for Lorentz transformations in the y- and 
^-directions. 



Figure 12.1 
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2. Additionally, the last three components must be a vector in 3-space. That is, they 
must transform like a usual vector under rotations in 3-space. 

3. We’ll use a capital Roman letter to denote a 4-vector. A bold-face letter will denote, 
as usual, a vector in 3-space. 

4. Lest we get tired of writing the c’s over and over, we will henceforth work in units 
where c = 1. 

5. The first component of a 4-vector is called the “time” component. The other three 
are the “space” components. 

6 . The components in ( dt,dx,dy,dz ) are sometimes referred to as (dx 0 , dx-\. d/x- 2 - dx 3 ). 
Also, some treatments use the indices “1” through “4”, with “4” being the “time” 
component. But we’ll use “0” through “3”. 

7. The Ai may be functions of the dx ,, the Xi and their derivatives, any invariants (that 
is, frame-independent quantities) such as the mass rn, and v. 

8 . 4-vectors are the obvious generalization of vectors in regular space. A vector in 3- 
dimensions, after all, is something that transforms under a rotation just like (dx, dy, dz) 
does. We have simply generalized a 3-D rotation to a 4-D Lorentz transformation. $ 


12.2 Examples of 4-vectors 

So far, we have only one 4-vector at our disposal, namely ( dt , dx, dy, dz). What are 
some others? Well, (7 dt, 7dx, 7dy, 7dz) certainly works, as does any other constant 
multiple of ( dt,dx,dy,dz ). Indeed, m(dt,dx,dy,dz) is a 4-vector, because m is an 
invariant (independent of frame). 

How about A = (dt, 2 dx, dy, dz)7 No, this isn’t a 4-vector, because on one hand 
it must transform (assuming it’s a T- vector) like 


dt = Ao 

= 7 (A' 0 + vAi) = 7 (dt' + v(2 dx ')), 

2 dx = Ai 

= 7 (A’ l + vA' 0 ) = 7 ((2 dx') + v dt'^j , 

dy = A 2 

= A' 2 = dy', 

dz = A 3 

= 4(5 = dz', 

from the definition of a 4-vector. But on the other hand, it transforms like 


dt = 7 (dt' + vdx'), 

2dx = 27 (dx' + v dt'), 
dy = dy', 

dz = dz', (12.3) 

because this is how the dxi transform. The two preceding sets of equations are 
inconsistent, so A = (dt,2dx,dy,dz) is not a 4-vector. Note that if we had instead 
considered the 4-tuplet, A = (dt, dx, 2d;y, dz), then the two preceding equations 

would have been consistent. But if we had then looked at how A transforms under 
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a Lorentz transformation in the y-direction, we would have found that it is not a 
4-vector. 

The moral of this story is that the above definition of a 4-vector is a nontrivial one 
because there are two possible ways that a 4-tuplet can transform. It can transform 
according to the 4-vector definition, as in eq. (12.2). Or, it can transform by simply 
having each of the Ai transform separately (knowing how the dx, transform), as in 
eq. (12.3). Only for certain special 4-tuplets do these two methods give the same 
result. By definition, we label these special 4-tuplets as 4-vectors. 

Let us now construct some less trivial examples of 4-vectors. In constructing 
these, we will make abundant use of the fact that the proper-time interval, dr = 
y/dt 2 — dr 2 , is an invariant. 

• Velocity 4-vector: We can divide ( dt,dx,dy,dz ) by dr, where dr is the 
proper time between two events (the same two events that yielded the dt, 
etc.). The result is indeed a 4-vector, because dr is independent of the frame 
in which it is measured. Using dr = dt/ 7, we obtain 

r = ildt,dx,dy,iz) *4" (l, % % i T) = (7.TV) (12.4) 

is a 4-vector. This is known as the velocity 4-vector. In the rest frame of the 
object we have v = 0, so V reduces to V = (1,0,0,0). With the c’s, we have 
V = (jc, 7v). 

• Energy-momentum 4-vector: If we multiply the velocity 4-vector by the 
invariant m, we obtain another 4-vector, 


P = mV = (7m,7mv) = [E, p), (12-5) 

which is known as the energy-momentum 4-vector (or the ^-momentum for 
short), for obvious reasons. In the rest frame of the object, P reduces to 
P = (m, 0,0,0). With the c’s, we have P = (qmc, qmv) = (E/c, p). Some 
treatments multiply through by c, so that the 4-momentum is (E,pc). 


• Acceleration 4-vector: We can also take the derivative of the velocity 4- 
vector with respect to r. The result is indeed a 4-vector, because taking the 
derivative simply entails taking the difference between two 4-vectors (which 
results in a 4-vector because eq. (12.1) is linear), and then dividing by the 
invariant dr (which again results in a 4-vector). Using dr = dt/'y, we obtain 


As 


dV 

dr 


d 

dr 


(7>7 V ) = 7 


(dry 

V dt 


d{ tv) ^ 
dt ) ' 


( 12 . 6 ) 


Using dj/dt = vv/(l — u 2 ) 3 / 2 = ^vv, we have 


A = ( 7 4 vh , 7 4 w)v + 7 2 a), 


(12.7) 


where a = dv/dt. A is known as the acceleration 4-vector. In the rest frame 
of the object (or, rather, in the instantaneous inertial frame), A reduces to 
A = (0,a). 
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As we always do, we will pick the relative velocity, v, to point in the x- 
direction. That is, v = (^,0,0). This means that v = v x . and also that 
v = i) x = a x } Eq. (12.7) then becomes 

A = (7 4 v x a x , 7 4 v 2 a x + 7 2 a x , j 2 a y , 7 2 a z ) 

= (7 A v x a x , 7 4 a x , 7 2 a y , 7 2 a z ). (12-8) 

We can keep taking derivatives with respect to r to create other 4-vectors, but 
these have little relevance in the real world. 


• Force 4-vector: We define the force 4-vector as 


F 


dP _ / dE dp \ _ / dE 

dr ^ V dt ’ dt ) ^ V dt 


(12.9) 


where f = d('-/mv)/dt is the usual 3-force. We’ll use f instead of F in this 
chapter, to avoid confusion with the 4-force, F. 

In the case where m is constant, 2 F can be written a s F = d{mV)/dr = 
mdV/dr = mA. We therefore still have a nice "F equals mi” law of physics, 
but it’s now a 4-vector equation instead of the old 3-vector one. In terms of 
the acceleration 4-vector, we may use eq. (12.7) to write (if m is constant) 

F = mA = (7 4 mro , r y 4 mvv\’ + 7 2 ma). (12.10) 


In the rest frame of the object (or, rather, the instantaneous inertial frame), F 
reduces to F = (0, f), because dE/dt = 0, as you can verify. Also, mA reduces 
to mA = (0,ma). Therefore, F = mA reduces to the familiar f = ma. 


12.3 Properties of 4-vectors 

The appealing thing about 4-vectors is that they have many useful properties. Let’s 
look at some of these. 

• Linear combinations: If A and B are 4-vectors, then C = a A + bB is also 
a 4-vector. This is true because the transformations in eq. (12.1) are linear 
(as we noted above when deriving the acceleration 4-vector). This linearity 
implies that the transformation of, say, the time component is 

Cq = (aA -T 6S)o = oAq -T bB 0 = o(Ag -|- vA r f) -|- b^B^ -|- vB r f) 

= (aA' 0 + bB' 0 ) + ufyAi + bB[) 

= C' 0 + vC[, (12.11) 

which is the proper transformation for the time component of a 4-vector. 
Likewise for the other components. This property holds, of course, just as it 
does for linear combinations of vectors in 3-space. 

1 The acceleration vector, a, is free to point in any direction, but you can check that the 0’s in 
v lead to v = a x . See Exercise 1. 

2 The mass m would not be constant if the object were being heated, or if extra mass were being 
added to it. We won’t concern ourselves with such cases here. 
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• Inner-product invariance: Consider two arbitrary 4-vectors, A and B. 
Define their inner product to be 

A ■ B = AqBq — A\B\ — A 2 B 2 — A 3 B 3 = AqBq — A • B. (12.12) 

Then A ■ B is invariant. That is, it is independent of the frame in which it is 
calculated. This can be shown by direct calculation, using the transformations 
in eq. (12.1): 

A ■ B = AqBq — A\B\ — A 2 B 2 — A 3 B 3 

= (7(^0 + wAl)) (7 (B' 0 + vB[)^j - (7(A; + vA' 0 )^j (y(B[ + vB' 0 )^j 

—A 2 B 2 — A 3 B 3 

= 7 2 (AoBo + v(A , qB , 1 + A^Bq) + v 2 A^B'^j 

+ V {A\ B' {) + AqB^) + v 2 A'qB'^ 

—A 2 B 2 — A3B3 

= A' 0 B' 0 ( 7 2 - 7 2 n 2 ) - A'M'f - 7 V) - A' 2 B' 2 - A' 3 B' 3 
= AqBq — A\ B\ — A i 2 B 2 — A 3 B 3 

= A’-B’. (12.13) 

The importance of this result cannot be overstated. This invariance is anal¬ 
ogous to the invariance of the inner product, A • B, for rotations in 3-space. 
The above inner product is also invariant under rotations in 3-space, because 
it involves the combination A • B. 

The minus signs in the inner product may seem a little strange. But the goal 
was to find a combination of two arbitrary vectors that is invariant under a 
Lorentz transformation (because such combinations are very useful in seeing 
what is going on in a problem). The nature of the LT’s demands that there 
be opposite signs in the inner product, so that’s the way it is. 

• Norm: As a corollary to the invariance of the inner product, we can look at 
the inner product of a 4-vector with itself, which is by definition the square of 
the norm. We see that 

A 2 = A ■ A = AqAq — A\A\ — A 2 A 2 — A 3 A 3 = Aq — | A| 2 (12.14) 

is invariant. This is analogous to the invariance of the norm \/A • A for 
rotations in 3-space. Special cases of the invariance of the 4-vector norm are 
the invariance of c 2 t 2 — x 2 in eq. (10.37), and the invariance of E 2 — p 2 in eq. 
(11.20). 

• A theorem: Here’s a nice little theorem: 


If a certain one of the components of a f-vector is 0 in every frame, then all 
four components are 0 in every frame. 
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Proof: If one of the space components (say, A\) is 0 in every frame, then the 
other space components must also be 0 in every frame, because otherwise a 
rotation would make A\ ^ 0. Also, the time component A$ must be 0 in every 
frame, because otherwise a Lorentz transformation in the ^-direction would 
make A\ ^ 0. 

If the time component, Aq, is 0 in every frame, then the space components 
must also be 0 in every frame, because otherwise a Lorentz transformation in 
the appropriate direction would make Aq ^ 0. ■ 

If someone comes along and says that she has a vector in 3-space that has no 
x’-component, no matter how you rotate the axes, then you would certainly say 
that the vector must obviously be the zero vector. The situation in Lorentzian 
4-space is basically the same, because all the coordinates get intertwined with 
each other in the Lorentz (and rotation) transformations. 


12.4 Energy, momentum 

12.4.1 Norm 

Many useful things arise from the simple fact that the P in eq. (12.5) is a 4-vector. 
The invariance of the norm implies that P ■ P = E 2 — |p| 2 is invariant. If we are 
dealing with only one particle, we can determine the value of P 2 by conveniently 
working in the rest frame of the particle (so that v = 0). We obtain 

E 2 — p 2 = m 2 , (12.15) 

or E 2 —p 2 c 2 = m 2 c 4 , with the c’s. We already knew this, of course, from just writing 
out E 2 -p 2 = 7 2 m 2 - 7 2 m 2 v 2 = to 2 . 

For a collection of particles, knowledge of the norm is very useful. If a process 
involves many particles, then we can say that for any subset of the particles, 

(^2^) — (T, p) is invariant, (12.16) 

because this is simply the norm of the sum of the energy-momentum 4-vectors of 
the chosen particles. The sum is again a 4-vector, due to the linearity of eqs. (12.1). 

What is the value of the invariant in eq. (12.16)? The most concise description 
(which is basically a tautology) is that it is the square of the energy in the CM 
frame, that is, in the frame where = 0. For one particle, this reduces to to 2 . 

Note that the sums are taken before squaring in eq. (12.16). Squaring before 
adding would simply give the sum of the squares of the masses. 

12.4.2 Transformation of E,p 

We already know how the energy and momentum transform (see Section 11.2), but 
let’s derive the transformation again here in a very quick and easy manner. We 
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know that (E,p x ,p y ,p z ) is a 4-vector. So it must transform according to eq. (12.1). 
Therefore (for an LT in the ^-direction), 

E = 7 (E' + vp' x ), 

Px = 7 {p' x + vE ')i 
P y = Py, 

Pz = p' z , (12.17) 

in agreement with eq. (11.18). That’s all there is to it. 

Remark: The fact that E and p are part of the same 4-vector provides an easy way 
to see that if one of them is conserved (in every frame) in a collision, then the other is 
also. Consider an interaction among a set of particles, and look at the 4-vector, A P = 
-Pafter — -^before- If E is conserved in every frame, then the time component of A P is 0 in 
every frame. But then the theorem in the previous section says that all four components of 
A P are 0 in every frame. Therefore, p is conserved. Likewise for the case where one of the 
Pi is known to be conserved. £ 

12.5 Force and acceleration 

Throughout this section, we will deal with objects with constant mass, which we 
will call “particles”. The treatment here can be generalized to cases where the mass 
changes (for example, the object is being heated, or extra mass is being dumped on 
it), but we won’t concern ourselves with these. 

12.5.1 Transformation of forces 

Let’s first look at the force 4-vector in the instantaneous inertial frame of a given 
particle (frame S'). Eq. (12.9) gives 

F/ = 7 (^’ f/ ) =( ° ,f/) ' (12 ’ 18) 

The first component is zero because dE'/dtt = d(mfs/\ — i/ z ')/dt. and this carries a 
factor of v', which is zero in this frame. Equivalently, you can just use eq.(12.10), 
with a speed of zero. 

We can now write down two expressions for the 4-force, F, in another frame, 
S. First, since F is a 4-vector, it transforms according to eq. (12.1). We therefore 
have, using eq. (12.18), 


= 7(7*0 + vF [) = 71//*, 
Fi = + vFfo = 7 f' x , 

Ei = 7*2 = fy, 

Fs = F' = f' z . 


(12.19) 
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fx 


-1* 

Figure 12.2 



— f x = f x 


i s 

Figure 12.3 


But second, from the definition in eq. (12.9), we also have 
F 0 = jdE/dt, 

Ft = ifxi 

F2 = ify, 

F 3 = if z . (12.20) 

Combining eqs. (12.19) and (12.20), we obtain 
dE/dt = vf ' x , 

fx = fx, 

fy = fyfr, 

fz = fh ■ (12.21) 

We therefore recover the results of Section 11.5.3. The longitudinal force is the same 
in both frames, but the transverse forces are larger by a factor of 7 in the particle’s 
frame. Hence, f y /f x decreases by a factor of 7 when going from the particle’s frame 
to the lab frame (see Fig. 12.2 and Fig. 12.3). 

As a bonus, the Tq component in eq. ( 12 . 21 ) tells us (after multiplying through 
by dt ) that dE = f x dx, which is the work-energy result. In other words, using 
f x = dp x /dt, we have just proved again the result, dE/dx = dp/dt , from Section 
11.5.1. 

As noted in Section 11.5.3, we can’t switch the S and S' frames and write 
f = f y /i . When talking about the forces on a particle, there is indeed one preferred 
frame of reference, namely that of the particle. All frames are not equivalent here. 
When forming all of our 4-vectors in Section 12.2, we explicitly used the dr, dt , dx, 
etc., from two events, and it was understood that these two events were located at 
the particle. 


12.5.2 Transformation of accelerations 

The procedure here is similar to the above treatment of the force. Let’s first look 
at the acceleration 4-vector in the instantaneous inertial frame of a given particle 
(frame S'). Eq. (12.7) or eq. (12.8) gives 

A'= (0,a'), (12.22) 

because v' = 0 in S'. 

We can now write down two expressions for the 4-acceleration, A, in another 
frame, S. First, since A is a 4-vector, it transforms according to eq. (12.1). So we 
have, using eq. (12.22), 

A 0 = i(A' 0 + vA\) = iva! x , 

A\ = q(A / 1 + vAq) = ia' x , 

A2 = A2 = Q*y, 

A 3 = Ag = CL Z . 


(12.23) 
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But second, from the expression in eq. (12.8), we also have 
Aq = 7 4 va x , 

Ai = 7 4 a x , 

A 2 = 7 2 a y , 

A$ = 7 2 a z . (12.24) 

Combining eqs. (12.23) and (12.24), we obtain 

a x = a'J 7 3 , 

a x = a^/ 7 3 , 

a y = a 'yh 2 i 

a z = a' 2 / 7 2 . (12.25) 

(The first two equations here are redundant.) We see that a y /a x increases by a 
factor of 7 3 /q 2 = 7 when going from the particle’s frame to the lab frame (see 
Fig. 12.4 and Fig. 12.5). This is the opposite of the effect on f y /f x - 3 This difference 
makes it clear that an f = ma law wouldn’t make any sense. If it were true in one 
frame, then it wouldn’t be true in another. 

Note also that the increase in a y /a x in going to the lab frame is consistent with 
length contraction, as the Bead-on-a-rod example in Section 11.5.3 showed. 





Figure 12.4 





Figure 12.5 


Example (Acceleration for circular motion): A particle moves with constant 
speed v along the circle x 2 + y 2 = r 2 , z = 0 , in the lab frame. At the instant the 
particle crosses the negative y -axis (see Fig. 12.6), find the 3-acceleration and 4- 
acceleration in both the lab frame and the instantaneous rest frame of the particle 
(with axes chosen parallel to the lab’s axes). 

Solution: Let the lab frame be S, and let the particle’s instantaneous inertial frame 
be S' when it crosses the negative y-axis. Then S and S' are related by a Lorentz 
transformation in the x-direction. 

The 3-acceleration in S is simply 

a = ( 0 , v 2 /r, 0 ). (12.26) 

There’s nothing fancy going on here; the nonrelativistic proof of a = v 2 /r works just 
fine again in the relativistic case. Eq. (12.7) or (12.8) then gives the 4-acceleration 
in S as 

A = ( 0 , 0 , 7 2 n 2 /r, 0 ). (12.27) 

To find the acceleration vectors in S', we will use the fact S' and S are related by 
a Lorentz transformation in the x-direction. Therefore, the A 2 component of the 
4-acceleration is unchanged. So the 4-acceleration in S' is also 

A' = A = ( 0 , 0,7 V/r,0). (12-28) 

3 In a nutshell, this difference is due to the fact that 7 changes with time. When talking about 
accelerations, there are 7 ’s that we have to differentiate; see eq. (12.6). This isn’t the case with 

forces, because the 7 is absorbed into the definition ofps 7 mv; see eq. (12.9). This is what leads 

to the different powers of 7 in eq. (12.24), in contrast with the identical powers in eq. (12.20). 


y 



Figure 12.6 
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In the particle’s frame, a' is simply the space part of A (using eq. (12.7) or (12.8), 
with v = 0 and 7 = 1). Therefore, the 3-acceleration in S' is 

a 1 = (0, 7 2 n 2 /r, 0). (12.29) 


Remark: We can also arrive at the two factors of 7 in a' by using a simple time-dilation 
argument. We have 

; = dV = d 2 y' _ 2 d?V _ 2^ 

° y dr 2 d(t/ y ) 2 ^ dt 2 ^ r ’ 

where we have used the fact that transverse lengths are the same in the t 1 


(12.30) 
0 frames, ft 


12.6 The form of physical laws 

One of the postulates of special relativity is that all inertial frames are equivalent. 
Therefore, if a physical law holds in one frame, then it must hold in all frames. 
Otherwise, it would be possible to differentiate between frames. As noted in the 
previous section, the statement “f = ma” cannot be a physical law. The two sides 
of the equation transform differently when going from one frame to another, so the 
statement cannot be true in all frames. 

If a statement has any chance of being true in all frames, it must involve only 
4-vectors. Consider a 4-vector equation (say, “A = B") which is true in frame S. 
Then if we apply to this equation a Lorentz transformation (call it Ad) from S to 
another frame S', we have 


A = B, 

=► MA = MB, 

=> A' = B'. 

(12.31) 

The law is therefore also true in frame S'. 

Of course, there are many Tvector equations that are simply not true (for ex¬ 
ample, F = P, or 2 P = 3 P). Only a small set of such equations (for example, 
F = mA) correspond to the real world. 

Physical laws may also take the form of scalar equations, such as P ■ P = m 2 . 
A scalar is by definition a quantity that is frame-independent (as we have shown 
the inner product to be). So if a scalar statement is true in one inertial frame, 
then it is true in all inertial frames. Physical laws may also be higher-rank “tensor” 
equations, such as arise in electromagnetism and general relativity. We won’t discuss 
such things here, but suffice it to say that tensors may be thought of as things built 
up from 4-vectors. Scalars and 4-vectors are special cases of tensors. 

All of this is exactly analogous to the situation in 3-D space. In Newtonian 
mechanics, f = ma is a possible law, because both sides are 3-vectors. But f = 
m(2a x , a y , a z ) is not a possible law, because the right-hand side is not a 3-vector; it 
depends on which axis you label as the x-axis. Another example is the statement 
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that a given stick has a length of 2 meters. That’s fine, but if you say that the stick 
has an x-component of 1.7 meters, then this cannot be true in all frames. 

God said to his cosmos directors, 

“I’ve added some stringent selectors. 

One is the clause 

That your physical laws 

Shall be written in terms of 4-vectors.” 
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12.7 Exercises 


B 



Figure 12.7 



1. Acceleration at rest 

Show that the derivative of v = <Jv^ ; + + v‘l equals a x . independent of how 

all the Vi’s are changing, provided that v y = v z = 0 at the moment in question. 

2. Linear acceleration * 

A particle’s velocity and acceleration both point in the x-direction. with mag¬ 
nitudes v and v, respectively (as measured in the lab frame). In the spirit 
of the example in Section 12.5.2, find the 3-acceleration and 4-acceleration in 
both the lab frame and the instantaneous rest frame of the particle. Verify 
that 3-accelerations are related according to eq. (12.25). 

3. Same speed * 

Consider the setup in Problem 2. Given v, what should 6 be so that the speed 
of one particle, as viewed by the other, is also v? Do your answers make sense 
for v sfe 0 and v & c? 

4. Three particles ** 

Three particles head off with equal speeds v, at 120° with respect to each 
other, as shown in Fig. 12.7. What is the inner product of any two of the 
4-velocities in any frame? Use your result to find the angle 6 (see Fig. 12.8) 
at which two particles travel in the frame of the third. 

5. Doppler effect * 

Consider a photon traveling in the x-direction. Ignoring the y and z com¬ 
ponents, and setting c = 1, the ^momentum is (p,p). In matrix notation, 
what are the Lorentz transformations for the frames traveling to the left and 
to the right at speed vl What is the new 4-momentum of the photon in these 
new frames? Accepting the fact the a photon’s energy is proportional to its 
frequency, verify that your results are consistent with the Doppler results in 
Section 10.6.1. 


Figure 12.8 
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1. Velocity addition 

In A’s frame, B moves to the right with speed u, and C moves to the left 
with speed v. What is the speed of B with respect to Cl In other words, use 
4-vectors to derive the velocity-addition formula. 

2. Relative speed * 

In the lab frame, two particles move with speed v along the paths shown in 
Fig. 12.9. The angle between the trajectories is 29. What is the speed of one 
particle, as viewed by the other? 

3. Another relative speed * 

In the lab frame, two particles, A and B, move with speeds u and v along the 
paths shown in Fig. 12.10. The angle between the trajectories is 6. What is 
the speed of one particle, as viewed by the other? 

4. Acceleration for linear motion * 

A spaceship starts at rest with respect to frame S and accelerates with con¬ 
stant proper acceleration a. In Section 10.7, we showed that the speed of the 
spaceship with respect to S is given by v(t) = tanh(ar), where r is the space¬ 
ship’s proper time (and c = 1). Let V be the spaceship’s T velocity, and let A 
be its 4-acceleration. In terms of the proper time r, 

(a) Find V and A in frame S, by explicitly using v(t) = tanh(ar). 

(b) Write down V and A in the spaceship’s frame, S'. 

(c) Verify that V and A transform like 4-vectors between the two frames. 



Figure 12.9 



• B 

Figure 12.10 
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A 's frame 

12.9 Solutions 


w C 

-► • 

C's frame 

Figure 12.11 


1. Velocity addition 

Let the desired speed of B with respect to C be w. See Fig. 12.11. 

In A’s frame, the 4-velocity of B is (p/ u , 7 u tt), and the 4-velocity of C is pf v . — 7 „u). 
We have suppressed the y and z components here. 

In C’s frame, the 4-velocity of B is {'J w ,'y w 'w), and the 4-velocity of C is (1,0). 

The invariance of the inner product implies 


{lu,luU) ■ (7u,-7w«) 

=> 7«7t>(l + uv ) 



{iwilwW) • ( 1 , 0 ) 


7w 



(12.32) 


Squaring and then solving for w gives 


(12.33) 


2. Relative speed 

In the lab frame, the 4-velocities of the particles are (suppressing the z component) 
( 7 oj 7 vfcos#, 7 „usin#) and ( 7 „, 7 „u cos #, — j v v sin#). (12.34) 


Let w be the desired speed of one particle as viewed by the other. Then in the frame 
of one particle, the 4-velocities are (suppressing two spatial components) 

( 7 u)i 7 u;w) and (1,0), (12.35) 

where we have rotated the axes so that the relative motion is along the rr-axis in this 
frame. Since the 4-vector inner product is invariant under Lorentz transformations 
and rotations, we have (using cos 2 # = cos 2 # — sin 2 #) 

(7i),7j)^cos#,7 v nsin#) • (7„,7„ncos#, —7 w nsin#) = (j w , j w w) ■ (1,0) 

=>■ 7 2 (1 — v 2 cos 2#) = Tu,. (12.36) 


Using the definitions of the 7 ’s, squaring, and solving for w gives 

I (1 _ w 2)2 y/lv 2 (1 - cos 29) - v A sin 2 2 # 

U y (1 — v 2 cos2#) 2 1 - p|ip^# 

If desired, this can be rewritten (using some double-angle formulas) in the form, 
_ 2 rsin#\/l - u 2 cos 2 # 


(12.37) 


1-v 2 cos 2 # 


(12.38) 


Remark: If 2# = 180°, then w = 2v/(l + v 2 ), in agreement with the standard velocity- 
addition formula. And if 9 = 0°, then w = 0, as should be the case. If 0 is very small, then 
you can show w « 2v sin 0/V% which is simply the relative speed in the lab frame, 
multiplied by the time dilation factor between the frames. (The particles’ clocks run slow, 
and transverse distances don’t change, so the relative speed is larger in a particle’s frame.) 
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3. Another relative speed 

In the lab frame, the 4-velocities of the particles are (suppressing the z component) 
Va = (7u,7u«,0) and Vb = (7 v ,7vVcos0,—j v vsm0). (12.39) 


Let w be the desired speed of one particle as viewed by the other. Then in the frame 
of one particle, the 4-velocities are (suppressing two spatial components) 

(' 1 w,lwW ) and (1,0), (12.40) 

where we have rotated the axes so that the relative motion is along the x-axis in this 
frame. Since the 4-vector inner product is invariant under Lorentz transformations 
and rotations, we have 


( 7 u> 7 «w,0) • ( 7 „, 7 „vcos$, — 7 „usin$) = ( 7 «,, 7 u,tw) ■ (1,0) 

=>■ 7 u 7 v(l — uncos 6) = j w . 

Using the definitions of the 7 ’s, squaring, and solving for w gives 

(1 — u 2 )(l — v 2 ) \fu 2 + v 2 — 2 uv cos 0 — u 2 v 2 sin 2 6 
(1 — uvcosO) 2 1 — uncos 9 

You can check various special cases of this result. 

4. Acceleration for linear motion 



(12.41) 


(12.42) 


(a) Using v(r) = tanh (ar), we have 7 = = cosh(ar). Therefore, 

V = ( 7 , 7 U) = (cosh(ar),sinh(aT)), (12.43) 

where we have suppressed the two transverse components of V. We then have 
A = = a(sinh(ar),cosh(ar)). (12.44) 

(b) The spaceship is at rest in its instantaneous inertial frame, so 

V' = (1,0). (12.45) 

In the rest frame, we also have 

A' = (0,a). (12.46) 

Equivalently, these are obtained by setting r = 0 in the results from part (a), 
because the spaceship hasn’t started moving at r = 0 , as is always the case in 
the instantaneous rest frame. 

(c) The Lorentz transformation matrix from S' to S is 

M = ( 1 1V ) = ( CO t!“ T ! Sin b“ T l ) . (12.47) 

\ l v 1 J V sin h(aT) cosh(ar) J 

We must check that 

v t )= M (v;) “ d <1248) 


These are easily seen to be true. 
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Chapter 13 

General Relativity 

Copyright 2004 by David Morin, morin@physics.harvard.edu 

This will be somewhat of a strange chapter, because we won’t have enough time to 
get to the heart of General Relativity (GR). But we will still be able to get a flavor 
of the subject and derive a few interesting GR results. 

One crucial idea in GR is the Equivalence Principle. This basically says that 
gravity is equivalent to acceleration. We will have much to say about this issue in the 
sections below. Another crucial concept in GR is that of coordinate independence. 
The laws of physics should not depend on what coordinate system you choose. This 
seemingly innocuous statement has surprisingly far-reaching consequences. How¬ 
ever, a discussion of this topic is one of the many things we won’t have time for. We 
would need a whole class on GR to do it justice. But fortunately, it is possible to 
get a sense of the nature of GR without having to master such things. This is the 
route we will take in this chapter. 

13.1 The Equivalence Principle 

Einstein’s Equivalence Principle says that it is impossible to locally distinguish be¬ 
tween gravity and acceleration. This may be stated more precisely in (at least) three 
ways. 


• Let person A be enclosed in a small box, far from any massive objects, that 
undergoes uniform acceleration (say, g ). Let person B stand at rest on the 
earth (see Fig. 13.1). The Equivalence Principle says that there are no local 
experiments these two people can perform that will tell them which of the two 
settings they are in. The physics of each setting is the same. 

• Let person A be enclosed in a small box that is in free-fall near a planet. 
Let person B float freely in space, far away from any massive objects (see 
Fig. 13.2). The Equivalence Principle says that there are no local experiments 
these two people can perform that will tell them which of the two settings they 
are in. The physics of each setting is the same. 

• “Gravitational” mass is equal to (or proportional to) “inertial” mass. Gravi¬ 
tational mass is the m g that appears in the formula, F = GMm g /r 2 = m g g. 

XIII-1 



Figure 13.1 





Figure 13.2 








XIII-2 


CHAPTER 13. GENERAL RELATIVITY 


Inertial mass is the ra* that appears in the formula, F = m^a. There is no 
a priori reason why these two m 's should be the same (or proportional). An 
object that is dropped on the earth will have acceleration a = (m g firing. For 
all we know, the ratio m g /mi for plutonium is different from that for copper. 
But experiments with various materials have detected no difference in the ra¬ 
tios. The Equivalence Principle states that the ratios are equal for any type 
of mass. 

This definition of the Equivalence Principle is equivalent to, say, the second 
one above for the following reason. Two different masses near B will stay right 
where they are. But two different masses near A will diverge from each other 
if their accelerations are not equal. 

These statements are all quite believable. Consider the first one, for example. 
When standing on the earth, you have to keep your legs firm to avoid falling down. 
When standing in the accelerating box, you have to keep your legs firm to maintain 
the same position relative to the floor (that is, to avoid “falling down”). You cer¬ 
tainly can’t naively tell the difference between the two scenarios. The Equivalence 
Principle says that it’s not just that you’re too inept to figure out a way to differen¬ 
tiate between them, but instead that there is no possible local experiment you can 
perform to tell the difference, no matter how clever you are. 

Remark: Note the inclusion of the words “small box” and “local” above. On the 
surface of the earth, the lines of the gravitational force are not parallel; they converge to the 
center. The gravitational force also varies with height. Therefore, an experiment performed 
over a non-negligible distance (for example, dropping two balls next to each other, and 
watching them converge; or dropping two balls on top of each other and watching them 
diverge) will have different results from the same experiment in the accelerating box. The 
equivalence principle says that if your laboratory is small enough, or if the gravitational 
field is sufficiently uniform, then the two scenarios look essentially the same. Jit 


13.2 Time dilation 

The equivalence principle has a striking consequence concerning the behavior of 
clocks in a gravitational field. It implies that higher clocks run faster than lower 
clocks. If you put a watch on top of a tower, and then stand on the ground, you will 
see the watch on the tower tick faster than an identical watch on your wrist. When 
you take the watch down and compare it to the one on your wrist, it will show more 
time elapsed. 1 Likewise, someone standing on top of the tower will see a clock on 
the ground run slow. Let’s be quantitative about this. Consider the following two 
scenarios. 

1 This will be true only if you keep the watch on the tower for a long enough time, because 
the movement of the watch will cause it to run slow due to the usual special-relativistic time 
dilation. But the (speeding-up) effect due to the height can be made arbitrarily large compared 
to the (slowing-down) effect due to the motion, by simply keeping the watch on the tower for an 
arbitrarily long time. 
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• A light source on top of a tower of height h emits flashes at time intervals t s . A 
receiver on the ground receives the flashes at time intervals t r (see Fig. 13.3). 
What is t r in terms of t s ? 

• A rocket of length h accelerates with acceleration g. A light source at the front 
end emits flashes at time intervals t s . A receiver at the back end receives the 
flashes at time intervals t r (see Fig. 13.4). What is t r in terms of t s ? 


The equivalence principle tells us that these two scenarios look exactly the same, 
as far as the sources and receivers are concerned. Hence, the relation between t r and 
t s is the same in each. Therefore, to find out what is going on in the first scenario, 
we will study the second scenario (because we can figure out how this one behaves). 

Consider an instantaneous inertial frame, S, of the rocket. In this frame, the 
rocket is momentarily at rest (at, say, t = 0), and then it accelerates out of the 
frame with acceleration g. The following discussion will be made with respect to 
the frame S. 

Consider a series of quick light pulses emitted from the source, starting at t = 0. 
The distance the rocket has traveled out of S at time t is gt 2 /2, so if we assume 
that t s is very small, then we may say that many light pulses are emitted before the 
rocket moves appreciably. Likewise, the speed of the source, namely gt, is also very 
small. We may therefore ignore the motion of the rocket, as far as the light source 
is concerned. 

However, the light takes a finite time to reach the receiver, and by then the 
receiver will be moving. We therefore cannot ignore the motion of the rocket when 
dealing with the receiver. The time it takes the light to reach the receiver is h/c, 
at which point the receiver has a speed of v = g(h/c). 2 Therefore, by the usual 
classical Doppler effect, the time between the received pulses is 3 


t r 


t s 

1 +( v/c )' 


(13.1) 


Therefore, the frequencies, f r = l/t r and f s = 1 / t s , are related by 


fr= 1 + - /.= 


1 + h 2 ) fs- 


(13.2) 


Returning to the clock-on-tower scenario, we see (using the equivalence principle) 
that an observer on the ground will see the clock on the tower running fast, by a 
factor 1 + gh/(?. This means that the upper clock really is running fast, compared 

2 The receiver moves a tiny bit during this time, so the “ft” here should really be replaced by 
a slightly smaller distance. But this yields a negligible second-order effect in the small quantity 
gh/c 2 , as you can show. To sum up, the displacement of the source, the speed of the source, and 
the displacement of the receiver are all negligible. But the speed of the receiver is quite relevant. 

3 Quick proof of the classical Doppler effect: As seen in frame S, when the receiver and a 
particular pulse meet, the next pulse is a distance ct s behind. The receiver and this next pulse 
then travel toward each other at relative speed c + v (as measured by someone in S). The time 
difference between receptions is therefore t T = ct s /(c + v). 



Figure 13.3 
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to the lower clock. 4 That is, 

At h = (l + ^j A t 0 . (13.3) 

A twin from Denver will be older than his twin from Boston when they meet up at 
a family reunion (all other things being equal, of course). 

Greetings! Dear brother from Boulder, 

I hear that you’ve gotten much older. 

And please tell me why 

My lower left thigh 

Hasn’t aged quite as much as my shoulder! 

Note that the gh in eq. (13.3) is the gravitational potential energy, divided by m. 

Remark: You might object to the above derivation, because t r is the time measured by 
someone in the inertial frame, S. And since the receiver is eventually moving with respect 
to S, we should multiply the f r in eq. (13.2) by the usual special-relativistic time-dilation 
factor, l/J% — (v/c) 2 (because the receiver’s clocks are running slow relative to S, so the 
frequency measured by the receiver is greater than that measured in S). However, this is 
a second-order effect in the small quantity v/c = gh/c 2 . We already dropped other effects 
of the same order, so we have no right to keep this one. Of course, if the leading effect in 
our final answer was second-order in v/c, then we would know that our answer was garbage. 
But the leading effect happens to be first order, so we can afford to be careless with the 
second-order effects. 

After a finite time has passed, the frame S will no longer be of any use to us. 
But we can always pick a new instantaneous rest frame of the rocket, so we can 
repeat the above analysis at any later time. Hence, the result in eq. (13.2) holds at 
all times. 

This GR time-dilation effect was first measured at Harvard by Pound and Rebka 
in 1960. They sent gamma rays up a 20 m tower and measured the redshift (that is, 
the decrease in frequency) at the top. This was a notable feat indeed, considering 
that they were able to measure a frequency shift of gh/c 2 (which is only a few parts 
in 10 15 ) to within 1% accuracy. 


13.3 Uniformly accelerated frame 

Before reading this section, you should think carefully about the “Break or not 
break” problem (Problem 25) in Chapter 10. Don’t look at the solution too soon, 
because chances are you will change your answer after a few more minutes of thought. 
This is a classic problem, so don’t waste it by peeking! 

4 Unlike the situation where two people fly past each other (as with the usual twin paradox), we 
can say here that what an observer sees is also what actually is. We can say this because everyone 
here is in the same frame. The “turnaround” effect that was present in the twin paradox is not 
present now. The two clocks can be slowly moved together without anything exciting or drastic 
happening to their readings. 
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Technically, the uniformly accelerated frame we will construct has nothing to do 
with GR. We will not need to leave the realm of special relativity for the analysis in 
this section. The reason we choose to study this special-relativistic setup in detail 
is because it shows many similarities to genuine GR situations, such as black holes. 


13.3.1 Uniformly accelerated point particle 


In order to understand a uniformly accelerated frame, we first need to understand 
a uniformly accelerated point particle. In Section 10.7, we briefly discussed the 
motion of a uniformly accelerated particle, that is, one that feels a constant force in 
its instantaneous rest frame. Let us now take a closer look at such a particle. 

Let the particle’s instantaneous rest frame be S', and let it start from rest in 
the inertial frame S. Let its mass be m. We know from Section 11.5.3 that the 
longitudinal force is the same in the two frames. Therefore, since it is constant in 
frame S', it is also constant in frame S. Call it /. For convenience, let g = f /m (so 
g is the proper acceleration felt by the particle). Then in frame S we have, using 
the fact that / is constant, 


_ dp _ d{m'yv) 
dt dt 


jv = gt 


fft 

v / r+W’ 


(13.4) 


where we have set c = 1. As a double-check, this has the correct behavior for t —> 0 
and t — * oo. If you want to keep the c’s in, then (gt) 2 becomes ( gt/c ) 2 , to make the 
units correct. 

Having found the speed in frame S at time t. the position in frame S at time t 
is given by 

x= C vdt= C vmn? = « + tet)2 ~ 0 - (13 - 5) 

For convenience, let P be the point (see Fig. 13.5) 

(x P ,t P ) = (-1/5,0). (13.6) 

Then eq. (13.5) yields 

(x - x P ) 2 - t 2 = ^ • (13.7) 

This is the equation for a hyperbola with its center (defined as the intersection of 
the asymptotes) at point P. For a large acceleration g, the point P is very close to 
the particle’s starting point. For a small acceleration, it is far away. 

Everything has been fairly normal up to this point, but now the fun begins. 
Consider a point A on the particle’s worldline at time t. From eq. (13.5), A has 
coordinates 

(au, = Q (\A + ( 9t ) 2 - l), i) ■ (13.8) 

The slope of the line PA is therefore 


t A — t P _ gt 
x A ~ x P i/l + (gt) 2 ’ 


hyperbola 



P hg| 

Figure 13.5 


(13.9) 
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Looking at eq. (13.4), we see that this slope equals the speed of the particle at 
point A. But we know very well that the speed v is the slope of the particle’s 
instantaneous a/-axis; see eq. (10.44). Therefore, the line PA and the particle’s 
x'-axis are the same line. This holds for any arbitrary time, t. So we may say 
that at any point along the particle’s worldline, the line PA is the instantaneous 
x'-axis of the particle. Or, said another way, no matter where the particle is, the 
event at P is simultaneous with an event located at the particle, as measured in the 
instantaneous frame of the particle. In other words, the particle always says that P 
happens “now.” 5 

Here is another strange fact. What is the distance from P to A. as measured in 
an instantaneous rest frame, S', of the particle? The 7 factor between frames S and 
S' is, using eq. (13.4), 7 = \J\ + (gt) 2 . The distance between P and A in frame S 
is x A — x P = \/l + ( gt ) 2 /g. So the distance between P and A in frame S' is (using 
the Lorentz transformation Ax = j(Ax' + vAt’), with At' = 0) 

x' A — x' p = —(x A — x P ) = - . (13.10) 

7 9 

This is independent of t\ Therefore, not only do we find that P is always simultane¬ 
ous with the particle, in the particle’s frame; we also find that P always remains the 
same distance (namely 1/g) away from the particle, as measured in the particle’s 
instantaneous rest-frame. This is rather strange. The particle accelerates away from 
point P, but it does not get further away from it (in its own frame). 

Remark: We can give a continuity argument that shows that such a point P must 
exist. If P is close to you, and if you accelerate away from it, then of course you get farther 
away from it. Everyday experience is quite valid here. But if P is sufficiently far away from 
you, and if you accelerate away from it, then the at 2 /2 distance you travel away from it can 
easily be compensated by the decrease in distance due to length contraction (brought about 
by your newly acquired velocity). This effect grows with distance, so we simply need to pick 
P to be sufficiently far away. What this means is that every time you get out of your chair 
and walk to the door, there are stars very far away behind you that get closer to you as you 
walk away from them (as measured in your instantaneous rest frame). By continuity, then, 
there must exist a point P that remains the same distance from you as you accelerate away 
from it. 4k 


13.3.2 Uniformly accelerated frame 

Let’s now put a collection of uniformly accelerated particles together to make a 
uniformly accelerated frame. The goal will be to create a frame where the distances 
between particles (as measured in any particle’s instantaneous rest frame) remain 
constant. 

Why is this our goal? We know from the “Break or not break” problem in 
Chapter 10 that if all the particles accelerate with the same proper acceleration, 

s The point P is very much like the event horizon of a black hole. Time seems to stand still at 
P. And if we went more deeply into GR, we would find that time seems to stand still at the edge 
of a black hole, too (as viewed by someone farther away). 







13.3. UNIFORMLY ACCELERATED FRAME 


XIII-7 


g, then the distances (as measured in a particle’s instantaneous rest frame) grow 
larger. While this is a perfectly possible frame to construct, it is not desirable here 
for the following reason. Einstein’s Equivalence Principle states that an accelerated 
frame is equivalent to a frame sitting on, say, the earth. We may therefore study 
the effects of gravity by studying an accelerated frame. But if we want this frame 
to look anything like the surface of the earth, we certainly can’t have distances that 
change over time. 

We therefore want to construct a static frame, that is, one where distances do 
not change (as measured in the frame). This will allow us to say that if we enclose 
the frame by windowless walls, then for all a person inside knows, he is standing 
motionless in a static gravitational field (which has a certain definite form, as we 
shall see). 

Let’s figure out how to construct the frame. We’ll discuss only the acceleration 
of two particles here. Others can be added in an obvious manner. In the end, the 
desired frame as a whole is constructed by accelerating each atom in the floor of the 
frame with a specific proper acceleration. 

From the previous subsection, we already have a particle A which is “centered” 
around the point P. 6 We claim (for reasons that will become clear) that every other 
particle in the frame should also be “centered” around the same point P. 

Consider another particle, B. Let a and b be the initial distances from P to A 
and B. If both particles are to be centered around P, then their proper accelerations 
must be, from eq. (13.6), 

9 a - 1 - and g B = \. (13.11) 

a b 

Therefore, in order to have all points in the frame be centered around P, we sim¬ 
ply have to make their proper accelerations inversely proportional to their initial 
distances from P. 

Why do we want every particle to be centered around P? Consider two events, 
Ea and Eb, such that P, Ea , and Eb are collinear in Fig. 13.6. Due to construction, 
the line PEaEb is the x'-axis for both particle A and particle B, at the positions 
shown. From the previous subsection, we know that A is always a distance a from 
P, and B is always a distance b from P. Combining these facts with the fact that 
A and B measure their distances along the P-axis of the same frame (at the events 
shown in the figure), we see that both A and B measure the distance between them 
to be b — a. This is independent of t, so A and B measure a constant distance 
between them. We have therefore constructed our desired static frame. This frame 
is often called a “Rindler space.” 

If a person walks around in the frame, he will think he lives in a static world where 
the acceleration due to gravity takes the form g(z) oc 1/z. where z is the distance 
to a certain magical point which is located at the end of the known “universe”. 

What if a person releases himself from the accelerating frame, so that he forever 
sails through space at constant speed? He thinks he is falling, and you should 

6 This will be our shorthand notation for “traveling along a hyperbola whose center is the point 
P.” 



Figure 13.6 
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convince yourself that he passes by the “magical point” in a finite proper time. But 
his friends who are still in the frame will see him take an infinitely long time to get 
to the “magical point” P. This is similar to the situation with a black hole. An 
outside observer will see it take an infinitely long time for a falling person to reach 
the “boundary” of a black hole, even though it will take a finite proper time for the 
person. 

Our analysis shows that A and B feel a different proper acceleration, because 
a ^ b. There is no way to construct a static frame where all points feel the same 
proper acceleration, so it is impossible to mimic a constant gravitational field (over 
a finite distance) by using an accelerated frame. 

13.4 Maximal-proper-time principle 

The maximal-proper-time principle in General Relativity says: Given two events in 
spacetime, a particle under the influence of only gravity takes the path in spacetime 
that maximizes the proper time. For example, if you throw a ball from given coor¬ 
dinates (xi,ti), and it lands at given coordinates (x 2 , t‘ 2 ). then the claim is that the 
ball takes the path that maximizes its proper time. 7 

This is clear for a freely-moving ball in outer space, far from any massive objects. 
The ball travels at constant speed from one point to another, and we know that this 
constant-speed motion is the motion with the maximal proper time. This is true 
because a ball (A) moving at constant speed would see the clock on any other ball 
(B) slowed down due to the special-relativistic time dilation, if there were a relative 
speed between them. (It is assumed here that R’s non-uniform velocity is caused 
by a non-gravitational force acting on it.) B would therefore show a shorter elapsed 
time. This argument does not work the other way around, because B is not in an 
inertial frame and therefore cannot use the special-relativistic time-dilation result. 

Consistency with Newtonian physics 

The maximal-proper-time principle sounds like a plausible idea, but we already know 
from Chapter 5 that the path an object takes is the one that yields a stationary 
value of the classical action, f(T — V). We must therefore demonstrate that the 
“maximal”-proper-time principle reduces to the stationary-action principle, in the 
limit of small velocities. If this were not the case, then we’d have to throw out our 
theory of gravitation. 

Consider a ball thrown vertically on the earth. Assume that the initial and final 
coordinates are fixed to be and (2/2^2)• Our plan will be to assume that 

the maximal-proper-time principle holds, and to then show that this leads to the 
stationary-act ion principle. 

7 The principle is actually the “stationary-proper-time principle.” As with the Lagrangian for¬ 
malism in Chapter 5, any type of stationary point (a maximum, minimum, or saddle point) is 
allowed. But although we were very careful about stating things properly in Chapter 5, we’ll be a 
little sloppy here and just use the word “maximum,” because that’s what it will generally turn out 
to be in the situations we will look at. However, see Problem 8. 
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Before being quantitative, let’s get a qualitative handle on what’s going on with 
the ball. There are two competing effects, as far as maximizing the proper time 
goes. On one hand, the ball wants to climb very high, because its clock will run 
faster there (due to the GR time dilation). But on the other hand, if it climbs very 
high, then it must move very fast to get there (because the total time, t 2 — ti, is 
fixed), and this will make its clock run slow (due to the SR time dilation). So there 
is a tradeoff. Let’s now look quantitatively at the implications of this tradeoff. 

The goal is to maximize 



(13.12) 


Due to the motion of the ball, we have the usual time dilation, dr = \/l — v 2 /c 2 dt. 
But due to the height of the ball, we also have the gravitational time dilation, 
dr = (1 + gy/(?)dt. Combining these effects gives 8 


dr = 



(13.13) 


Using the Taylor expansion for \|| —• e, and dropping terms of order 1/c 4 and smaller, 
we see that we want to maximize 


I> k £ 
- r 

Jt i 



(13.14) 


The “1” term gives a constant, so maximizing this integral is the same as minimizing 



which is the classical action, as desired. For a one-dimensional gravitational problem 
such as this one, the action will always be a minimum, and the proper time will 
always be a maximum, as you can show by considering the second-order change in 
the action (see Exercise 10). 

In retrospect, it is not surprising that this all works out. The factor of 1/2 in 
the kinetic energy here comes about in exactly the same way as in the derivation 
in eq. (11.9), where we showed that the relativistic form of energy reduces to the 
familiar Newtonian expression. 


13.5 Twin paradox revisited 

Let’s take another look at the standard twin paradox, this time from the perspective 
of General Relativity. We should emphasize that GR is by no means necessary for an 

8 This result is technically not correct; the two effects are intertwined in a somewhat more 
complicated way (see Exercise 7). But it is valid up to order v 2 /c 2 , which is all we are concerned 
with, since we are assuming ti«c. 
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understanding of the original formulation of the paradox (the first scenario below). 
We were able to solve it in Section 10.2.2, after all. The present discussion is given 
simply to show that the answer to an alternative formulation (the second scenario 
below) is consistent with what we’ve learned about GR. Consider the two following 
twin-paradox setups. 

• Twin A floats freely in outer space. Twin B flies past A in a spaceship, with 
speed vo (see Fig. 13.7). At the instant they are next to each other, they both 
set their clocks to zero. At this same instant, B turns on the reverse thrusters 
of his spaceship and decelerates with proper deceleration g. B eventually 
reaches a farthest point from A and then accelerates back toward A, finally 
passing him with speed vo again. When they are next to each other, they 
compare the readings on their clocks. Which twin is younger? 

• Twin B stands on the earth. Twin A is thrown upward with speed vo (let’s 
say he is fired from a cannon in a hole in the ground). See Fig. 13.8. At the 
instant they are next to each other, they both set their clocks to zero. A rises 
up and then falls back down, finally passing B with speed vo again. When 
they are next to each other, they compare the readings on their clocks. Which 
twin is younger? 

The first scenario is easily solved using special relativity. Since A is in an inertial 
frame, he may apply the results of special relativity. In particular, A sees B’ s clock 
run slow, due to the usual special-relativistic time dilation. Therefore, B ends up 
younger at the end. Note that B cannot use the reverse reasoning, because she is 
not in an inertial frame. 

What about the second scenario? The key point to realize is that the Equivalence 
Principle says that these two scenarios are exactly the same , as far as the twins are 
concerned. Twin B has no way of telling whether she is in a spaceship accelerating 
at g or on the surface of the earth. And A has no way of telling whether he is 
floating freely in outer space or in free-fall in a gravitational field. 9 We therefore 
conclude that B must be younger in the second scenario, too. 

At first glance, this seems incorrect, because in the second scenario, B is sitting 
motionless, while A is the one who is moving. It seems that B should see A’s 
clock running slow, due to the usual special-relativistic time dilation, and hence A 
should be younger. This reasoning is incorrect because it fails to take into account 
the gravitational time dilation. The fact of the matter is that A is higher in the 
gravitational field, and therefore his clock runs faster. This effect does indeed win 
out over the special-relativistic time dilation, and A ends up older. You can explicitly 
show this in Problem 11. 

Note that the reasoning in this section is another way to conclude that the 
Equivalence Principle implies that higher clocks must run faster (in one way or 

9 This fact is made possible by the equivalence of inertial and gravitational mass. Were it not 
for this, different parts of A’s body would accelerate at different rates in the gravitational field in 
the second scenario. This would certainly clue him in to the fact that he was not floating freely in 
space. 
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another). The Equivalence Principle implies that A must be older in the second 
scenario, which means that there must be some height effect that makes A’s clock 
run fast (fast enough to win out over the special-relativistic time dilation). But it 
takes some more work to show that the factor is actually 1 + gh/c 2 . 

Also note that the fact that A is older is consistent with the maximal-proper-time 
principle. In both scenarios, A is under the influence of only gravity (zero gravity in 
the first scenario), whereas B feels a normal force from either the spaceship’s floor 
or the ground. 
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13.6 Exercises 

Section 13.1: The Equivalence Principle 

1. Driving on a hill 

You drive up and down a hill of height h at constant speed. What should your 
speed be so that you age the same amount as someone standing at the base 
of the hill? Assume that the hill is in the shape of an isosceles triangle with 
altitude h. 

2. Lv/c 2 and gh/c 2 * 

The familiar special-relativistic “head-start” result, Lv/c 2 , looks rather similar 
to the gh/c 2 term in the GR time-dilation result, eq. (13.3). Imagine standing 
at the front of a train of length L. For small v, devise a thought experiment 
that explains how the Lv/<? result follows from the gh/c 2 result. 

3. Opposite circular motion **** 

A and B move at speed v (v <C c) in opposite directions around a circle of 
radius r (so they pass each other after each half-revolution). They both see 
their clocks ticking at the same rate. Show this in three ways. Work in: 

(a) The lab frame (the inertial frame whose origin is the center of the circle). 

(b) The frame whose origin is B and whose axes remain parallel to an inertial 
set of axes. 

(c) The rotating frame that is centered at the origin and rotates along with 
B. 

Hints: See Problem 4, which is similar, although easier. And take a look at 
the Einstein limerick in Section 9.2. 

Section 13.2: Uniformly accelerated frame 

4. Using rapidity * 

Another way to derive the v in eq. (13.4) is to use the v = tanh(gr) rapidity 
result (where r is the particle’s proper time) from Section 10.7. Use time 
dilation to show that this implies gt = sinh(gr), and hence eq. (13.4). 

5. Various quantities * 

A particle starts at rest and accelerates with proper acceleration g. Let r be 
the time on the particle’s clock. Using the v from eq. (13.4), use time dilation 
to show that the time t in the original inertial frame, the speed of the particle, 
and the associated 7 factor are given by (with c = 1) 

gt = sinh(gr), v = tanh(yr), 7 = cosh(gr). (13.16) 




13.6. EXERCISES 


XIII-13 


6. Redshift ** 

We found in Section 13.2 that a clock at the back of a rocket will see a clock 
at the front run fast by a factor 1 + gh/c 2 . However, we ignored higher-order 
effects in 1/c 2 , so for all we know, the factor is actually, say, e gh ^ c , or perhaps 
In (gh/c 2 ), and we found only the first term in the Taylor series. 

(a) For the uniformly accelerated frame in Section 13.3.2, show that the factor 
is in fact exactly 1 + gbh/c 2 , where g\ •, is the acceleration of the back of 
the rocket. Show this by lining up a series of clocks and looking at the 
successive factors between them. 

(b) By the same reasoning, it follows that the front clock sees the back clock 
running slow by a factor 1 — gfh/c 2 , where gf is the acceleration of the 
front. Show explicitly that (1 + <7b/c 2 )(l — gfh/c 2 ) = 1, as must be the 
case, because a clock can’t gain time with respect to itself. 

7. Gravity and speed combined ** 

Use a Minkowski diagram to do this problem (in the spirit of Problem 10.24, 
“Acceleration and redshift”). 

A rocket accelerates with proper acceleration g toward a planet. As measured 
in the instantaneous inertial frame of the rocket, the planet is a distance x 
away and moves at speed v. Everything is in one dimension here. 

As measured in the accelerating frame of the rocket, show that the planet’s 
clock runs at a rate (with c = 1), 

dtp = dt r ( 1 + gx)V 1 — v 2 . (13.17) 

And show that the planet’s speed is 

V=(l + gx)v. (13.18) 

Note that if we combine these two results to eliminate v, and if we then invoke 
the equivalence principle, we arrive at the result that a clock moving at height 
h and speed V in a gravitational field is seen by someone on the ground to 
run at a rate (putting the c’s back in), 



(13.19) 


8. Speed in accelerating frame * 

In the setup in Problem 6, use eq. (13.20) to find the speed of the planet, 
dx/dr, as a function of r. What is the maximum value of this speed, in terms 
of g and the initial distance, LI 
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9. Accelerating stick’s length ** 

Consider a uniformly accelerated frame consisting of a stick, the ends of which 
have worldlines given by the curves in Fig. 13.6 (so the stick has proper 
length b — a). At time t in the lab frame, we know that a point that undergoes 
acceleration g has position yfl + (gt) 2 /g relative to the point P in Fig. 13.6. 
An observer in the original inertial frame will see the stick being length- 
contracted by different factors along its length, because different points move 
with different speeds (at a given time in the original frame). Show, by doing 
the appropriate integral, that this observer will conclude that the stick always 
has proper length b — a. 

Section 13.3: Maximal proper-time principle 

10. Maximum proper time * 

Show that the extremum of the gravitational action in eq. (13.15) is always a 
minimum. Do this considering a function, y(t ) = yo(t) + £(f), where yo is the 
path that extremizes the action, and £ is a small variation. 

Section 13.4-' Twin paradox revisited 

11. Symmetric twin non-paradox ** 

Two twins travel in opposite directions at speed v (v <C c) with respect to the 
earth. They synchronize their clocks when they pass each other. They travel 
to stars located a distance £ away, and then decelerate and accelerate back up 
to speed v in the opposite direction (uniformly, and in a short time compared 
to the total journey time). 

In the frame of the earth, it is obvious (from symmetry) that both twins age 
the same amount by the time they pass each other again. Reproduce this 
result by working in the frame of one of the twins. 
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13.7 Problems 

Section 13.1: The Equivalence Principle 

1. Airplane’s speed 

A plane flies at constant height h. What should its speed be so that an observer 
on the ground sees the plane’s clock tick at the same rate as a ground clock? 
(Assume v « c.) 

2. Clock on tower ** 

A clock starts on the ground and then moves up a tower at constant speed v. 
It sits on top of the tower for a time T and then descends at constant speed 
v. If the tower has height h, how long should the clock sit at the top so that 
it comes back showing the same time as a clock that remained on the ground? 
(Assume v « c.) 

3. Circular motion ** 

Person B moves at speed v (with v <C c) in a circle of radius r around person 
A. By what fraction does B' s clock run slower than As? Calculate this in 
three ways. Work in: 

(a) As frame. 

(b) The frame whose origin is B and whose axes remain parallel to an inertial 
set of axes. 

(c) The rotating frame that is centered at A and rotates around A with the 
same frequency as B. 

4. More circular motion ** 

A and B move at speed v (v « c) in a circle of radius r, at diametrically 
opposite points. They both see their clocks ticking at the same rate. Show 
this in three ways. Work in: 

(a) The lab frame (the inertial frame whose origin is the center of the circle). 

(b) The frame whose origin is B and whose axes remain parallel to an inertial 
set of axes. 

(c) The rotating frame that is centered at the origin and rotates with the 
same frequency as A and B. 

Section 13.2: Uniformly accelerated frame 

5. Getting way ahead **** 

A rocket with proper length L accelerates from rest, with proper acceleration 
g (where gL <C c 2 ). Clocks are located at the front and back of the rocket. If 
we look at this setup in the frame of the rocket, then the general-relativistic 
time-dilation effect tells us that the times on the two clocks are related by 
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tf = (1 + gL/( 2 )t b . Therefore, if we look at things in the ground frame, then 
the times on the two clocks are related by 


tf = t b 


giA 

" c*J 


Lv 


where the last term is the standard special-relativistic “head-start” result. 
Derive the above relation by working entirely in the ground frame. 10 


6. Accelerator’s point of view *** 

A rocket starts at rest relative to a planet, a distance £ away. It accelerates 
toward the planet with proper acceleration g. Let r and t be the readings on 
the rocket’s and planet’s clocks, respectively. 


(a) 


Show that when the astronaut’s clock reads r, he observes the rocket- 
planet distance, x (as measured in his instantaneous inertial frame), to 
be given by 


1 +gx = 


1 + g£ 

cosh(gr) 


(13.20) 


(b) Show that when the astronaut’s clock reads r, he observes the time, t, 
on the planet’s clock to be given by 


gt = (1 + g£) tanh(gr). (13.21) 


The results from Exercises 5 and 7 will be useful here. 


7. Z/u/c 2 revisited ** 

You stand at rest relative to a rocket that has synchronized clocks at its ends. 
It is then arranged for you and the rocket to move with relative speed v. A 
reasonable question to now ask is: As viewed by you, what is the difference in 
readings on the clocks located at the ends of the rocket? 

It turns out that this question cannot be answered without further information 
on how you and the rocket got to be moving with relative speed v. There are 
two basic ways this relative speed can come about. The rocket can accelerate 
while you sit there, or you can accelerate while the rocket sits there. Using 
the results from Problems 5 and 6, explain what the answers to the above 
question are in these two cases. 

8. Circling the earth ** 

Clock A sits at rest on the earth, and clock B circles the earth in an orbit that 
skims along the ground. Both A and B are essentially at the same radius, so 

10 You may find this relation surprising, because it implies that the front clock will eventually 
be an arbitrarily large time ahead of the back clock, in the ground frame. (The subtractive Lv/c? 
term is bounded by L/c and will therefore eventually become negligible compared to the additive, 
and unbounded, ( gL/c 2 )tb term.) But both clocks are doing basically the same thing relative to 
the ground frame, so how can they eventually differ by so much? Your job is to find out. 
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the GR time-dilation effect yields no difference in their times. But B is moving 
relative to A, so A will see B running slow, due to the usual SR time-dilation 
effect. The orbiting clock, B, will therefore show a smaller elapsed proper time 
each time it passes A. In other words, the clock under the influence of only 
gravity (B) does not show the maximal proper time, in conflict with what we 
have been calling the maximal-proper-time principle. Explain. 

Section 13-4 ■' Twin paradox revisited 

9. Twin paradox * 

A spaceship travels at speed v (v <C c) to a distant star. Upon reaching the 
star, it decelerates and then accelerates back up to speed v in the opposite 
direction (uniformly, and in a short time compared to the total journey time). 
By what fraction does the traveler age less than her twin on earth? (Ignore 
the gravity from the earth.) Work in: 

(a) The earth frame. 

(b) The spaceship frame. 

10. Twin paradox again ** 

(a) Answer the previous problem, except now let the spaceship turn around 
by moving in a small semicircle while maintaining speed v. 

(b) Answer the previous problem, except now let the spaceship turn around 
by moving in an arbitrary manner. The only constraints are that the 
turn-around is done quickly (compared to the total journey time), and 
that it is contained in a small region of space (compared to the earth-star 
distance). 

11. Twin paradox times *** 

(a) In the first scenario in Section 13.5, calculate the ratio of B 's elapsed time 
to A’s, in terms of vq and g. Assume that Vo c, and drop high-order 
terms. 

(b) Do the same for the second scenario in Section 13.5. Do this from scratch 
using the time dilations, and then check that your answer agrees (within 
the accuracy of the calculations) with part (a), as the equivalence prin¬ 
ciple demands. 
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13.8 Solutions 


1. Airplane’s speed 

An observer on the ground sees the plane’s clock run slow by a factor \Jl — v 2 /c 2 due 
to SR time dilation. But he also sees it run fast by a factor (1 + gh/c 2 ) due to GR 
time dilation. We therefore want the product of these two factors to equal 1. Using 
the standard Taylor-series approximation for slow speeds in the first factor, we find 



Neglecting the small 1/c 4 term, and cancelling the l’s, yields v = y/2gh. 
Interestingly, \/2gh is also the answer to a standard question from Newtonian physics, 
namely, how fast must you throw a ball straight up if you want it to reach a height 

m 


2. Clock on tower 


The SR time-dilation factor is yl — v^/c 2 ss 1 — v 2 /2c 2 . The clock therefore loses a 
fraction v 2 /2c 2 of the time elapsed during its motion up and down the tower. The 
upward journey takes a time h/v, and likewise for the downward trip, so the time loss 
due to the SR effect is 


(£)(?)-?■ 


(13.23) 


Our goal is to balance this time loss with the time gain due to the GR time-dilation 
effect. If the clock sits on top of the tower for a time T, then the time gain is 


(13.24) 


But we must not forget also the increase in time due to the height gained while the 
clock is in motion. During its motion, the clock’s average height is h/2. The total 
time in motion is 2 h/v, so the GR time gain while the clock is moving is 


^g{h/2pj ^2h'j = gh 2 


(13.25) 


Setting the total change in the clock’s time equal to zero gives 


vh gh gh 2 


igT-1 


(13.26) 


Therefore, 



(13.27) 


Remarks: Note that we must have v > \fgh- in order for a positive solution for T to exist. 
If v < \fgh, then the SR effect is too small to cancel out the GR effect, even if the clock 
spends no time sitting at the top. If v = \[gh, then T = 0, and we essentially have the same 
situation as in Exercise 1. Note also that if v is very large compared to \fgh (but still small 
compared to c, so that our \J 1 — v 2 /c 2 « 1 — v 2 /2c 2 approximation is valid), then T w v/g, 
which is independent of h. Jit 
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3. Circular motion 

(a) In A’s frame, there is only the SR time-dilation effect. A sees B move at speed 
v, so B’s clock runs slow by a factor of y/l — v 2 /c 2 . And since v -C c, we may 
use the Taylor series to approximate this as 1 -v 2 /2c 2 . 

(b) In this frame, there are both SR and GR time-dilation effects. A moves at speed 
v with respect to B in this f rame, so there is the SR effect that A’s clock runs 
slow by a factor ^/l — v 2 /c 2 « 1 — v 2 /2c 2 . 

But B undergoes an acceleration of a = v 2 /r toward A, so there is also the GR 
effect that A’s clock runs fast by a factor 1 + ar/c 2 = 1 + v 2 /c 2 . 

Multiplying these two effects together, we find (to lowest order) that A’s clock 
runs fast by a factor 1 + v 2 /2c 2 . This means (to lowest order) that B’s clock 
runs slow by a factor 1 — v 2 /2c 2 , in agreement with the answer to part (a). 

(c) In this frame, there is no relative motion between A and B, so there is only 
the GR time-dilation effect. The gravitational field (that is, the centripetal 
acceleration) at a distance x from the center is g x = xu> 2 . Imagine lining up a 
series of clocks along a radius, with separation dx. Then the GR time-dilation 
result tells us that each clock loses a fraction g x dx/(? = xlo 2 dx/c 2 of time 
relative to the clock just inside it. Integrating these fractions from x = 0 to 
x = r shows that B’ s clock loses a fraction r 2 u> 2 /2c 2 = v 2 /2c 2 , compared to A’s 
clock. This agrees with the results in parts (a) and (b). 

4. More circular motion 

(a) In the lab frame, the situation is symmetric with respect to A and B. Therefore, 
if A and B are decelerated in a symmetric manner and brought together, then 
their clocks must read the same time. 

Assume (in the interest of obtaining a contradiction), that A sees B’s clock run 
slow. Then after an arbitrarily long time, A will see B’s clock an arbitrarily 
large time behind his. Now bring A and B to a stop. There is no possible way 
that the stopping motion can make B’s clock gain an arbitrarily large amount of 
time, as seen by A. This is true because everything takes place in a finite region 
of space, so there is an upper bound on the GR time-dilation effect (because 
it behaves like gh/c 2 , and h is bounded). Therefore, A will end up seeing B’s 
clock reading less. This contradicts the result of the previous paragraph. 

Remark: Note how this problem differs from the problem where A and B move with 
equal speeds directly away from each other, and then reverse directions and head back 
to meet up again. 

For this new “linear” problem, the symmetry reasoning in the first paragraph above 
still holds; they will indeed have the same clock readings when they meet up again. 
But the reasoning in the second paragraph does not hold (it better not, because each 
person does not see the other person’s clock running at the same rate). The error is 
that in this linear scenario, the experiment is not contained in a small region of space, 
so the turning-around effects of order gh/c 2 become arbitrarily large as the time of 
travel becomes arbitrarily large, since h grows with time (see Problem 9). X 

(b) In this frame, there are both SR and GR time-dilation effects. A moves at 
speed 2v with respect to B in this frame (we don’t need to use the relativistic 
velocity-addition formula, be cause v c) , so this gives the SR effect that A’s 
clock runs slow by a factor \J\ — (2 v) 2 /c 2 « 1 — 2 v 2 /c 2 . 
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But B undergoes an acceleration of a = v 2 /r toward A, so there is also the GR 
effect that A’s clock runs fast by a factor 1 + a(2r)/c 2 = 1 + 2v 2 /c 2 (because 
they are separated by a distance 2 r). 

Multiplying these two effects together, we find (to lowest order) that the two 
clocks run at the same rate. 

(c) In this frame, there is no relative motion between A and B. Hence, there is 
only the GR effect. But A and B are both at the same gravitational potential, 
because they are at the same radius. Therefore, they both see the clocks running 
at the same rate. 

If you want, you can line up a series of clocks along the diameter between A and 
B, as we did along a radius in part (c) of Problem 3. The clocks will gain time 
as you march in toward the center, and then lose back the same amount of time 
as you march back out to the diametrically opposite point. 


5. Getting way ahead 

The explanation of why the two clocks show different times in the ground frame is the 
following. The rocket becomes increasingly length contracted in the ground frame, 
which means that the front end isn’t traveling as fast as the back end. Therefore, the 
time-dilation factor for the front clock isn’t as large as that for the back clock. So the 
front clock loses less time relative to the ground, and hence ends up ahead of the back 
clock. Of course, it’s not at all obvious that everything works out quantitatively, and 
that the front clock eventually ends up an arbitrarily large time ahead of the back 
clock. In fact, it’s quite surprising that this is the case, because the above difference 
in speeds is rather small. But let’s now show that the above explanation does indeed 
account for the difference in the clock readings. 

Let the back of the ro cket be located at position x. Then the front is located at 
position x + Ls /1 — v 2 (with c = 1), due to the length contraction. Taking the time 
derivatives of the two positions, we see that the speeds of the back and front are (with 
v = dx/dt ) n 

Vb = v, and Vf = v( 1 — Isyv), (13.28) 

For Vb, we will simply invoke the result in eq. (13.4), 


Vb 


gt 

Vl + ( 9t ) 2 ’ 


(13.29) 


where t is the time in the ground frame. 


Having found v, we must now find the y-factors associated with the speeds of the front 
and back of the rocket. The y-factor associated with the speed of the back (namely 
v) is 


76 


= yrrW- 


(13.30) 


The q-factor associated with the speed of the front, Vf = v(l — Lyv), is a little harder 
to obtain. We must first calculate v. From eq. (13.29), we find v = g/(l + g 2 t 2 ) 3 / 2 , 
which gives 


Vf = u(l — Lyv) = 


gt 

Vi + (gt) 2 


gL \ 

1+gH 2 ) 


(13.31) 


11 Since these speeds are not equal, there is of course an ambiguity concerning which speed we 
should use in the length-contraction factor, Vl — « 2 . Equivalently, the rocket actually doesn’t have 
one inertial frame that describes all of it. But you can show that any differences arising from this 
ambiguity are of higher order in gL/c 2 than we need to be concerned with. 
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The 7 -factor (or rather 1 / 7 , which is what we’ll be concerned with) associated with 
this speed can now be found as follows. In the first line below, we ignore the higher- 
order ( gL ) 2 term, because it is really (gL/c 2 ) 2 , and we are assuming that gL/c 2 
is small. And in obtaining the third line, we use the Taylor-series approximation, 
y/T^e ~ 1 — e/ 2 . 


9 2 t 2 , 

1 + g 2 t 2 1 


2 gL \ 

1 + gH 2 ) 


2 gH 2 L 
1 + g 2 t 2 
9 3 t 2 L \ 
1 + gH 2 ) 


(13.32) 


We can now calculate the time that each clock shows, at time t in the ground frame. 
The time on the back clock changes according to dtb = dt/'yb , so eq. (13.30) gives 


(13.33) 


The integral 12 of 1 /a/1 + x 2 is sinh -1 x. Letting x = gt, this gives 

gtb = sinh~ 1 (gt). (13.34) 

The time on the front clock changes according to dtf = dt/jf, so eq. (13.32) gives 
_ f* dt /'* g 3 t 2 Ldt 

Jo v/l — g 2 t 2 Jo 1 


tf = 


(13.35) 


VtfW ' J 0 (1 + ^2)3/2 • 

The integral 13 of x 2 /(l + a ; 2 ) 3 / 2 is sinh -1 x — x/\/l + x 2 . Letting x = gt, this gives 

gt 


gt f = sinh \gt) + (gL) sinh 1 (gt) 


(13.36) 


/1 + g 2 t 2 j 

Using eqs. (13.29), (13.34), and (13.29), we may rewrite this as 

gtf = gtb(l + gL) — gLv. (13.37) 

Dividing by g, and putting the c’s back in to make the units correct, we finally have 

= + < 13 - 38 > 

as we wanted to show. 


Remark: Looked at from the reverse point of view, this calculation, which uses only special- 
relativity concepts, demonstrates that someone at the back of a rocket sees a clock at the 
front running fast by a factor (1 + gL/c 2 ). There are, however, far easier ways of deriving 
this, as we saw in Section 13.2 and in Problem 10.24 (“Acceleration and redshift”). A 

12 To derive this, make the substitution x = sinh#. 

13 Again, to derive this, make the substitution x = sinh 6. 
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6. Accelerator’s point of view 


(a) First Solution: Eq. (13.5) says that the distance traveled by the rocket (as 
measured in the original inertial frame), as a function of the time in the inertial 
frame, is 

d= X - (yi + (<?*) 2 - l) • (13.39) 

An inertial observer on the planet therefore measures the rocket-planet distance 
to be 

x = f-i(v / TTW-l )• (13.40) 

The rocket observer will see this leng th being c ontracted by a factor 7 . Using the 
result of Exercise 5, we have 7 = i/l + ( gt ) 2 = cosher). So the rocket-planet 
distance, as measured in the instantaneous inertial frame of the rocket, is 


1- 1 (cosh(gr) - 1 ) 
cosh(gr) 


1 +gx = 


1 + gt 

cosh (gr) ’ 


(13.41) 


as desired. 

Second Solution: Eq. (13.18) gives the speed of the planet in the accelerating 
frame of the rocket. Using the results of Exercise 5 to write v in terms of r, we 
have (with c = 1 ) 

^ = -(1 + gx) tanh(firr). (13.42) 

dr 

Separating variables and integrating gives 

J \ — = ~ J tan Mfi ,r ) => ln(l + gx) = — In (cosher)) + C 

=> 1+9X= ^(g^- (13 ‘ 43) 

Since the initial condition is x = I when r = 0, we must have A = 1 + gt, which 
gives eq. (13.20), as desired. 

(b) Eq. (13.17) says that the planet’s clock runs fast (or slow) according to 


dt = dr (1 + gx)\J 1 — v 2 . (13.44) 

The results of Exercise 5 yield \J 1 — v' 2 = 1 /cosh(gr). Combining this with the 
result for 1 + gx above, and integrating, gives 

f dt = f ( 1 + 9 £ ) => gt = (1 + gt) tanh(flrr), (13.45) 

J J cosh (gr ) 

as desired. 


7. Lv/c 2 revisited 


Consider first the case where the rocket accelerates while you sit there. Problem 5 is 
exactly relevant here, and it tells us that in your frame the clock readings are related 
by 


tf — tb 



Lv 


(13.46) 
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You will eventually see the front clock an arbitrarily large time ahead of the back 
clock. Note that for small times (before things become relativistic), the standard 
Newtonian result, v sfc gtf,, is valid, so we have 

'/»(*»+$) ~ L i = <*• (13.47) 

We see that in the setup where the rocket is the one that accelerates, both clocks 
show essentially the same time near the start (the leading term in the time difference 
is of order ( v/c ) 2 ). This makes sense; both clocks have essentially the same speed at 
the beginning, so to lowest order their 7 factors are the same, so the clocks run at the 
same rate. But eventually the front clock will get ahead of the back clock. 

Now consider the case where you accelerate while the rocket sits there. Problem 6 is 
relevant here, if we let the rocket in that problem now become you, and if we let two 
planets a distance L apart become the two ends of the rocket. The times you observe 
on the front and back clocks on the rocket are then, using eq. (13.45) and assuming 
that you are accelerating toward the rocket, 

gtf = (1 + gl) tanh(gr), and gtt, = (l + g{£ + L)) tanh(gr). (13.48) 

But from Exercise 5, we know that your speed relative to the rocket is v = tanh gr. 
Eq. (13.48) therefore gives tb = tf + Lv, or f b = f/ + Lv/c 2 with the c 2 . So in this 
case we arrive at the standard Lv/c 2 “head-start” result. 

The point here is that in this second case, the clocks are synchronized in the rocket 
frame, and this is the assumption that went into our derivation of the Lv/c 2 result in 
Chapter 10. In the first case above where the rocket accelerates, the clocks are not 
synchronized in the rocket frame (except right at the start), so it’s not surprising that 
we don’t obtain the Lv/c 1 result. 

8. Circling the earth 

This is one setup where we really need to use the correct term, stationary- proper-time 
principle. It turns out that B’s path yields a saddle point for the proper time. The 
value at this saddle point is less than A’s proper time, but this is irrelevant, because 
we only care about local extrema, not global ones. 

B’s path is a saddle point because there exist nearby paths that give both a larger 
and smaller proper time. 14 The proper time can be made smaller by having B speed 
up and slow down. This will cause a net increase in the time-dilation effect as viewed 
by A, thereby yielding a smaller proper time. 15 The proper time can be made larger 
by having B take a nearby path that doesn’t quite form a great circle on the earth. 
(Imagine the curve traced out by a rubber band that has just begun to slip away 
from a great-circle position.) This path is shorter, so B won’t have to travel as fast 
to get back in a given time, so the time-dilation effect will be smaller as viewed by A, 
thereby yielding a larger proper time. 

14 The differences are in fact second-order ones, because the first-order ones vanish due to the fact 
that the path satisfies the Euler-Lagrange equations for the Lagrangian in eq. (13.15). 

ls This is true for the same reason that a person who travels at constant speed in a straight line 
between two points will show a larger proper time than a second person who speeds up and slows 
down. This follows directly from SR time dilation, as viewed by the first person. If you want, 
you can imagine unrolling B’s circular orbit into a straight line, and then invoke the result just 
mentioned. As far as SR time-dilation effects from clock A’s point of view go, it doesn’t matter if 
the circle is unrolled into a straight line. 
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Figure 13.9 


9. Twin paradox 

(a) In the earth frame, the spaceship travels at speed v for essentially the whole time. 
Therefore, the traveler ages less by a fraction ^/l'—•ItA/e 2 « 1 — v 2 /2c 2 . The 
fractional loss of time is thus v 2 /2c 1 . The time-dilation effect will be different 
during the short turning-around period, but this is negligible. 

(b) Let the distance to the star be £, as measured in the frame of the earth (but the 
difference in lengths in the two frames is negligible in this problem), and let the 
turnaround take a time T. Then the given information says that T -C (21)/w. 
During the constant-speed p|f| i :of the trip, the traveler sees the earth clock 
running slow by a fraction ^/l — ij 2 /c 2 ss 1 — v 2 /2c 2 . The time for this constant- 
speed part is 2£/v, so the earth clock loses a time of (v 2 /2c 2 )(2£/v) = v£/(?. 
However, during the turnaround time, the spaceship is accelerating toward the 
earth, so the traveler sees the earth clock running fast, due to the GR time 
dilation. The magnitude of the acceleration is a = 2v/T, because the spaceship 
goes from velocity v to —v in time T. The earth clock therefore runs fast by a 
factor 1 + aI/(? = 1 + 2v£/Tc 2 . This happens for a time T, so the earth clock 
gains a time of (2 v£/Tc 2 )T = 2 v£/c 2 . 

Combining the results of the previous two paragraphs, we see that the earth clock 
gainsatimeof2uf/c 2 — vt/c? = v£/c 2 . This is a fraction {v£/c 2 )/(2£/v) =v 2 /2 c 2 
of the total time, in agreement with part (a). 


10. Twin paradox again 

(a) The only difference between this problem and the previous one is the nature of 
the turnaround, so all we need to show here is that the traveler still sees the 
earth clock gain a time of 2 v£/(? during the turnaround. 

Let the radius of the semicircle be r. Then the magnitude of the acceleration is 
a = v 2 /r. Let 9 be the angle shown in Fig. 13.9. For a given 9, the earth is at a 
height of essentially l cos 9 in the gravitational field felt by the spaceship. The 
fractional time that the earth gains while the traveler is at angle 9 is therefore 
ah/c 2 = (v 2 /r)(£cos9)/c 2 . Integrating this over the time of the turnaround, 
and using dt = rd9/v, we see that the earth gains a time of 

At = (!^) (^) = , (13.49) 

during the turnaround, as we wanted to show. 

(b) Let the acceleration vector at a given instant be a, and let £ be the vector from 
the spaceship to the earth. Note that since the turnaround is done in a small 
region of space, £ is essentially constant here. 

The earth is at a height of essentially a • £ in the gravitational field felt by the 
spaceship. (The dot product just gives the cosine term in the above solution to 
part (a).) The fractional time gain, ah/c 2 , is therefore equal to |a|(a • £)/c 2 = 
a ■£/<?. Integrating this over the time of the turnaround, we see that the earth 
gains a time of 


a £ , £ Tf 

t = l. dt ~ 3'i, ° 


“2 • ( V / - V 0 
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£.(2v f ) 


during the turnaround, as we wanted to show. The whole point here is that no 
matter what complicated motion the traveler undergoes during the turnaround, 
the total effect is to simply change the velocity from v outward to v inward. 

11. Twin paradox times 

(a) As viewed by A, the times of the twins are related by 

dt B = Vl-v 2 dt A - (13.51) 


(13.50) 


Assuming vo -C c, we may say that v(t A ) is essentially equal to vo — gt A , so the 
out and back parts of the trip each take a time of essentially vo/g in A’s frame. 
The total elapsed time on B’s clock is therefore 



& 2 jj' 9 (l-\{v 0 -gt) 2 ^jdt 


f | \ p/s 

_ 2u 0 .Vq 

9 3gc 2 ’ 

where we have put the c’s back in to make the units right, 
elapsed time to A’s is therefore 


(13.52) 
The ratio of -B’s 


Tb _ T b _ 1 _ _^o_ 
T a ~ 2 v 0 /g ~ 6c 2 ' 


(13.53) 


(b) As viewed by B, the relation between the twins’ times is given by eq. (13.13), 


dt A 



(13.54) 


Assuming vo c, we may say that v(t B ) is essentially equal to vo~gt B , and A’s 
height is essentially equal to vot B — gt%/2. The up and down parts of the trip 
each take a time of essentially v 0 /g in B’s frame. Therefore, the total elapsed 
time on A’s clock is (using the approximation in eq. (13.14), and dropping the 
c’s) 


T a 


/ 


dt A 


2 f - ^(A) - gt) 2 + g(v 0 t - 9 ^/ 2 )^ dt. 
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up to corrections of order v\/c 4 . This agrees with the result found in part (a), 
as the equivalence principle requires. 



Chapter 14 


Appendices 

Copyright 2004 by David Morin, morin@physics.harvard.edu 

14.1 Appendix A: Useful formulas 

14.1.1 Taylor series 


f(x 0 + e) = f(x 0 ) + f'{x o)e + ^ ^ e 2 + ^ ^ e 3 + • • • 

(14.1) 

— = l + x + x 2 + x 3 -\ - 

1 — x 

(14.2) 

1 = 1 + 2x + 3a? 2 + 4a? 3 -I- 

(1 - a?) 2 

(14.3) 

ln(l - a?) = -a? - y - y- 

(14.4) 

XI X 2 X 3 

e^ = 1 + x +¥ + - + ... 

(14.5) 

a; 2 a? 4 

cosx = l- ¥ + --... 

(14.6) 

X 3 X 5 

(14.7) 


(14.8) 

1 x 3a? 2 

_ 2 + ^ + "' 

(14.9) 

(1 + a?) n = 1 + nx + a; 2 + x 3 4- 

(14.10) 
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14.1.2 

Nice formulas 



e 10 = cos 9 + i sin 9 

(14.11) 


cos 0 = ± (e i0 + e~ i0 ) , sin 9 = ^ (e i0 - e“*®) 

(14.12) 


9 /l + costf . 0 /1 — cos 0 

2 V 2 2 V 2 

(14.13) 


0 /1 — cos 0 1 — cos 0 sin 0 

tan 2 y 1 + cos 9 sin 9 1 + cos 9 

(14.14) 


sin 29 = 2 sin 9 cos 9, cos 29 = cos 2 9 — sin 2 9 

(14.15) 


sin(a + (3) = sin a cos (3 + cos a sin /I 

(14.16) 


cos (a + fi) = cos a cos fi — sin a sin f3 

(14.17) 


. ,, tan a + tan 3 

tan (a + (3) = -' 

1 — tan a tan (3 

(14.18) 


coshx = ~(e x + e~ x ), sinhx = ~(e x — e~ x ) 

(14.19) 


cosh 2 x — sinh 2 x = 1 

(14.20) 


-f- cosh x = sinh x, -f- sinh x = cosh x 

ax ax 

(14.21) 

14.1.3 

Integrals 



J In x dx = x In x — x 

(14.22) 


f X ^ 

J 2 4 

(14.23) 


J xe x = e x (x — 1) 

(14.24) 


f dx , i 

/ -„ = tan x or — cot x 

J 1 + x 2 

(14.25) 
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I dx _1 { x 2 

J x(l + x 2 ) 2 n y 1 + x 


dx 


(\ + X 

1 — X 2 

Vl — x 

dx 

= 5>“l 

(x + V 

1 -x 2 

^x — 1, 


= sin x or — cos x 


x or — esc x 


f dx 

J TT^ 

/ , = ln(x + \Jx 2 + 1) or sinh -1 x 

Vx 2 + 1 

/ , = ln(x + Vx 2 — 1) or cosh -1 x 

Vx 2 - 1 

r dx _i 

/ — , „ = sec 

J xy/x 2 — 1 

f dx fl + s/l + x 2 \ 

J xy/l + x 2 "~ n { x J 

f dx (l + Vl ~ x 2 \ 

Jw7^ = -' n { - x -j 

f dx , /l + sinx\ 

f dx _ j /I — cosx\ 

J since V sinx J 


or — csch x 

or — sech - 1 x 


(14.26) 

(14.27) 

(14.28) 

(14.29) 

(14.30) 

(14.31) 

(14.32) 

(14.33) 

(14.34) 

(14.35) 

(14.36) 
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14.2 Appendix B: Units, dimensional analysis 

There are two strategies you should invoke without hesitation when solving a prob¬ 
lem. One is the consideration of units (that is, dimensions), which is the subject of 
this appendix. The other is the consideration of limiting cases, which is the subject 
of the next appendix. 

The consideration of units offers two main benefits. First, looking at units before 
you start a calculation can tell you roughly what the answer has to look like, up to 
numerical factors. (And in some problems, you can determine the numerical factors 
by considering a limiting case of a certain parameter. So in some problems, you 
actually don’t have to do any calculations!) Second, checking units at the end of a 
calculation (which is something you should always do) tells you if your answer has a 
chance at being correct. It won’t tell you that your answer is definitely correct, but 
it might tell you that your answer is definitely incorrect. If your goal in a problem 
is to find, say, a length, and if you end up with a mass, then you know it’s time to 
look over your work. 

“Your units are wrong!” cried the teacher. 

“Your church weighs six joules — what a feature! 

And the people inside 

Are four hours wide, 

And eight gauss away from the preacher!” 



Figure 14.1 


In practice, the second of the above two benefits is what you will generally make 
use of. But let’s do a few examples relating to the first benefit, since these can be 
a little more exciting. To solve the following problems exactly, we would need to 
invoke results derived in earlier chapters in the text. But let’s just see how far we can 
get by using only dimensional analysis. We’ll use the “[ ]” notation for units, and 
we’ll let M stand for mass, L for length, and T for time. For example, we will write 
a speed as [u] = L/T and the gravitational constant as [G] = L 3 /(MT 2 ) (you can 
figure this out by noting that Gm\m 2 /r 2 has the dimensions of force). Alternatively, 
you can just use the mks units, kg, m, s, instead of M, L, T, respectively. 1 


Example 1 (The pendulum): A mass m hangs from a massless string of length £ 
(see Fig. 14.1) and swings back and forth in the plane of the paper. The acceleration 
due to gravity is g. What can we say about the frequency of oscillations? 

The only dimensionful quantities given in the problem are [to] = M, [l] = L, and 
[g] = L/T 2 . There is one more quantity, the maximum angle 9q , which is dimensionless 
(this one is easy to forget). Our goal is to find the frequency, which has units of 1/T. 
The only combination of our given dimensionful quantities that has units of 1/T is 


1 When you check units at the end of a calculation, you will invariably be working with the 
kg,m,s notation. So that notation will inevitably get used more. But I’ll use the M,L,T notation 
in this appendix, because I think it’s a little more instructive. At any rate, just remember that the 
letter m (or M) stands for “meter” in one case, and “mass” in the other. 
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\fgfl. 2 But we can’t rule out any 0 O dependence, so the most general possible form 
for the frequency (in radians per second) is 


^ = / ( ^ o) yf, (14-37) 

where / is a dimensionless function of the dimensionless variable 9o. 

Remarks: It just so happens that for small oscillations, f ( 0 o ) is essentially equal to 1, 
and so the frequency is essentially equal to a Jg/t ■ But there is no way to show this by 
using only dimensional analysis. For larger values of 0o, the higher-order terms in the 
expansion of / become important. Exercise 3.8 deals with the leading correction; the answer 
is f (0 0 ) = 1 - 91 /16 + • • 

Note that since there is only one mass scale in the problem, there is no way that the frequency 
(with units of 1/T) can depend on [m] = M. If it did, there would be nothing to cancel out 
the units of mass and produce a pure inverse-time. & 

What can we say about the total energy (relative to the lowest point) of the pendulum? 
Energy has units of MI 2 /! 2 , and the only combination of the given dimensionful 
constants of this form is mg£. Therefore, the energy must be of the form f{9g)mg£, 
where / is some function. That’s as far as we can go with dimensional analysis. Of 
course, if we actually invoke a little physics, we know that the total energy equals 
the potential energy at the highest point, which is rngl{\ — cos 0q). Using the Taylor 
expansion for cos 9, we see that f(0 o) = 9 %/2 — 9q /24 + • • •. Unlike in the frequency 
above, the maximum angle, 9q, plays a critical role in the energy. 


Example 2 (The spring): A spring with spring-constant k has a mass m on its 
end (see Fig. 14.2). The spring force is F(x) = —kx, where x is the displacement 
from the equilibrium. What can we say about the frequency of oscillations? 

The only dimensionful quantities in this problem are [m] = M, [k] = M/T 2 (obtained 
by noting that kx has the dimensions of force), and the maximum displacement from 
the equilibrium, [xo] = L. (There is also the equilibrium length, but the force doesn’t 
depend on this, so there is no way it can come into the answer.) Our goal is to find the 
frequency, which has units of 1/T. The only combination of our given dimensionful 
quantities with these units is 



where C is a dimensionless number. Note that, in contrast with the pendulum above, 
the frequency cannot have any dependence on the maximum displacement. It just 
so happens that C is equal to 1, but there is no way to show this by using only 
dimensional analysis. 

What can we say about the total energy of the spring? Energy has units of ML 2 /T 2 , 
and the only combination of the given dimensionful constants of this form is Bkxl, 


2 You can verify this by writing down a general product of the given quantities raised to arbitrary 
powers (that is, m a i b g c ), and writing out the units of this product in terms of a, b, and c. When 
you set the units equal to 1/T, you will obtain a system of equations in a,b,c, and you will find that 
the solution is a = 0, b = —1/2, and c = 1/2. In more complicated problems, this method may 
turn out to be necessary. But in most problems, you can quickly see what the correct combination 
of the given quantities is. 


g-g-&r , 


Figure 14 
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where B is a dimensionless number. It turns out that B = 1/2, so the total energy 
equals kx\/2 . 

Remark: A real spring doesn’t have a perfectly parabolic potential (that is, an exactly linear 
force), so the force actually looks something like F(x) = —kx + bx 2 + ■ ■ ■. If we truncate 
the series at the second term, then we have one more dimensionful quantity to work with, 
[6] = M/LT 2 . To form a quantity with the dimensions of frequency, 1/T, we need the xo 
and b to appear in the combination xob, because this is the only way to get rid of the L. 
You can then see (by using the strategy of writing out a general product of the variables, 
discussed in the above footnote) that the frequency must be of the form f(x 0 b/k)^/k/m. 
We therefore can have xo dependence in this case. Note that this answer must reduce to 
Cy/k/m, for 6 = 0. Hence, / must be of the form f(y) = C + ci y + ciy 2 H-. & 


Example 3 (Low-orbit satellite) : A satellite of mass m travels in a circular orbit 
just above the earth’s surface. What can we say about its speed? 

The only dimensionful quantities in the problem are [to] = M, [ 5 ] = L/T 2 , and the 
radius of the earth [f?] = L . 3 Our goal is to find the speed, which has units of L/T. 
The only combination of our dimensionful quantities with these units is 

v = C\fgR. (14.39) 


It turns out that C - 1 . 


14.2.1 Exercises 

1. Pendulum on Pluto 

If a pendulum has a period of 3 s on the earth, what would its period be if it 
were placed on the moon? Use g M /gE ~ 1/6. 


2. Earth and moon radii 

The value of g on the surface of a planet is given by 
GM 

9 = J?’ 


(14.40) 


where M and R are the mass and radius of the planet, respectively, and G is 
Newton’s gravitational constant. If the densities of the moon and the earth 
are related by p m /Pe = 3/5, and if g M /gE = 1/6, what is R M /R E 1 


3 You might argue that the mass of the earth, M e , and Newton’s gravitational constant, G, 
should be included here. But for a particle located at the surface of the earth, these quantities 
appear only in the gravitational force through the combination ( GM e /R 2 )m = mg. So we can 
absorb the effects of M e and G into g. 
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3. Escape velocity 

The escape velocity 4 on the surface of a planet is given by 



where M and R are the mass and radius of the planet, respectively, and G is 
Newton’s gravitational constant. 

(a) Write v in terms of the average mass density p, instead of M. 

(b) Assuming that the average density of the earth is four times that of 
Jupiter, and that the radius of Jupiter is 11 times that of the earth, what 
is Vj/v E ? 

4. Waves on a string 

How does the speed of waves on a string depends its mass M, length L, and 
tension (that is, force) T? 

5. Vibrating water drop 

Consider a vibrating water drop, whose frequency ( u ) depends on its radius 
( R ), mass density (p), and surface tension (S'). 5 How does v depend on R, p, 
and S'? 

14.2.2 Problems 

1. Escape velocity 

Show that the escape velocity from the earth is given by eq. (14.41), up to 
numerical factors. 

2. Mass in tube 

A tube of mass M and length l is free to swing by a pivot at one end. A mass 
m is positioned inside the tube at this end. The tube is held horizontal and 
then released (see Fig. 14.3). Let p be the fraction of the tube the mass has 
traversed by the time the tube becomes vertical. Does p depend on £? 

3. Waves in a fluid 

How does the speed of waves in a fluid depend on its density, p , and “Bulk 
Modulus,” B (which has units of pressure, which is force per area)? 

4. Vibrating star 

Consider a vibrating star, whose frequency (v) depends on its radius (R), mass 
density (p), and Newton’s gravitational constant (G). How does v depend on 
R , p, and G? 

4 This is the velocity needed to refute the “What goes up must come down” maxim (neglecting 
air resistance and such). 

®The units of surface tension are (Force)/(Length). 



Figure 14.3 
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5. Damping ** 

A particle with mass m and initial speed V is subject to a velocity-dependent 
damping force of the form bv n . 

(a) For 7) — 0 , 1,2f...., determine how the stopping time depends on m, V, 
and b. 

(b) For ft — 0,1,2,..., determine how the stopping distance depends on m, 
V, and b. 

Be careful! See if your answers make sense. Dimensional analysis only gives 
the answer up to a numerical factor. This is a tricky problem, so don’t let it 
discourage you from using dimensional analysis. Most applications of dimen¬ 
sional analysis are quite straightforward. 

14.2.3 Solutions 

1. Escape velocity 

The reasoning is the same as in the satellite example above. Therefore, the answer 
is v = Cy/gR = Cy/GM e /R, where we have used g = GM e /R 2 . It turns out that 
C=V2 . 

This quick solution is actually not quite rigorous, considering the footnote in the 
above satellite example. Since the particle is not always at the same radius, the force 
changes, so it isn’t obvious that we can absorb the M e and G dependence into one 
quantity, g, as we did with the orbiting satellite. Let us therefore be more rigorous 
with the following reasoning. 

The dimensionful quantities in the problem are [m] = M, the radius of the earth 
[R] = L, Newton’s gravitational constant [G] = L 3 /MT 2 , and the mass of the earth 
[Afe] = M. 

If we use no information other than these given quantities, then there is no way to 
arrive at the speed of C^GM e /R, because for all we know, there could be a factor 
of (m/M e ) 7 in the answer. This number is dimensionless, so it wouldn’t mess up the 
units. 

If we want to make any progress in this problem, we have to use the fact that the force 
takes the form of GM e m/r 2 . This then implies that the acceleration is independent 
of m. And since the path of the particle is determined by its acceleration, we see that 
our answer cannot depend on to. We are therefore left with the quantities G, R, and 
M, and yo u can sh ow that the only combination of these quantities that gives a speed 
is v = CyfGMjR. 

2. Mass in tube 

The dimensionful quantities are [5] = L/T 2 , [I] = L, [to] = M, and [M\ = M. We 
want to produce a dimensionless number 77. Since g is the only constant involving 
time, 77 cannot depend on g. This then implies that 77 cannot depend on l, the only 
length remaining. Therefore, 77 depends only on the ratio m/M. So the answer to the 
stated problem is, “No.” 

It turns out that you have to solve the problem numerically to find 77 (see Problem 
7.4). Some results are: If to M, then 77 « 0.349. If to = M, then 77 ss 0.378. And 
if to = 2M, then 77 ~ 0.410. 
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3. Waves in a fluid 


We want to make a speed, [i>] = L/T, out of the quantities [p] = M/L 3 , and [B] = 
[. F/A) = ( ML/T 2 )/(L 2 ) = M/(LT 2 ). We can play around with these quantities to 
find the combination that has the correct units, but let’s do it the no-fail way. If 
v = p a B b , then we have 


L _ (M\ a (My 
\Lfi) ' 


(14.42) 


Matching up the powers of the three kinds of units on each side of this equation gives 


M : 0 = a + b, L : 1 = —3a — b, T : -1 =-2 b. (14.43) 


The solution to t his sy stem of equations is a = —1/2 and b = 1/2. Therefore, our 
answer is v oc \JB/p. Fortunately, there was a solution to this system of three 
equations in two unknowns. 


4. Vibrating star 


We want to make a frequency, [v\ = 1/T, out of the quantities [/?] = L, [p] = M/L 3 , 
and [G] = L 3 /(MT 2 ). These units for G follow from the gravitational force law, 
F = Gmim 2 /r 2 . We can play around with these quantities to find the combination 
that has the correct units, but let’s do it the no-fail way. If v = R a p b G c , then we 
have 


T 


L a 



(14.44) 


Matching up the powers of the three kinds of units on each side of this equation gives 


M : 0 = 6 — c, L : 0 = a — 36 + 3c, T : -1 = -2c. (14.45) 

The solution to this system of three equations is a = 0, and b = c = 1/2. Therefore, 
our answer is v oc sfpG- 


Remark: Note the difference in the given quantities in this problem (R , p, and G) and the 
ones in Exercise 5 ( R , p, and S). In this problem with the star, the mass is large enough 
so that we can ignore the surface tension, S. And in Exercise 5 with the drop, the mass is 
small enough so that we can ignore the gravitational force, and hence G. Jb 

5. Damping 

(a) The constant b has units [b] = [Force][v - ”] = ( ML/T 2 ){T n /L n ). The other 
constants are [to] = M and [V] = L/T. There is also n, which is dimensionless. 
You can show that the only combination of these constants that has units of T 
is 

t = f(n)^ = r, (14.46) 

where f(n) is a dimensionless function of n. 

For n = 0, we have t = f(())mV/b. This increases with m and V, and decreases 
with b, as it should. 

For n = 1 we have t = f(l')rn/b. So we seem to have t ~ m/b. This, however, 
cannot be correct, because t should definitely grow with V. A large initial speed 
V\ requires some non-zero time to slow down to a smaller speed 14, after which 
point we simply have the same scenario with initial speed V2- Where did we 
go wrong? After all, dimensional analysis tells us that the answer does have to 
look like t = f(l)m/b, where /(1) is a numerical factor. 
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The resolution to this puzzle is that /(1) is infinite. If we worked out the problem 
using F = ma, we would encounter an integral that diverges. So for any V, we 
would find an infinite f. 6 

Similarly, for n > 2, there is at least one power of V in the denominator of t. 
This certainly cannot be correct, because t should not decrease with V. So f(n) 
must likewise be infinite for all of these cases. 

The moral of this exercise is that you have to be careful when using dimensional 
analysis. The numerical factor in front of your answer nearly always turns out 
to be of order 1, but in some strange cases it turns out to be 0 or oo. 

Remark: For n > 1, the expression in eq. (14.46) still has relevance. For example, 
for n = 2, the m/(Vb) expression is relevant if you want to know how long it takes to 
go from V to some final speed Vf. The answer involves m/(Vfb), which diverges as 
V f ^0.* 

(b) You can show that the only combination of these constants that has units of L 
l = 9(n) (14.47) 

where g(n) is a dimensionless function of n. 

For n = 0, we have £ = g(0)mV 2 /b. This increases with V, as it should. 

For n = 1, we have £ = g(l)rnV/b. This increases with V, as it should. 

For n = 2 we have £ = g(2)m/b. So we seem to have £ ~ m/b. But as in part (a), 
this cannot be correct, because £ should definitely depend on V. A large initial 
speed Vi requires some non-zero distance to slow down to a smaller speed V2, 
after which point we simply have the same scenario with initial speed Yi- So, 
from the reasoning in part (a), the total distance is infinite for n > 2, because 
the function g is infinite. 

Remark: Note that for »/l, ( and l are either both finite or both infinite. For 
n SS’||, however, the total time is infinite, whereas the total distance is finite. This 
situation actually holds for 1 < n < 2, if we want to consider fractional n. Jb 


6 The total time t is actually undefined, because the particle never comes to rest. But t does 
grow with V, in the sense that if t is defined to be the time to attain some given small speed, then 
t grows with V. 
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14.3 Appendix C: Approximations, limiting cases 

Along with checking units, checking limiting cases (or special cases) is something 
you should always do at the end of a calculation. As in the case with checking 
dimensions, this won’t tell you that your answer is definitely correct, but it might tell 
you that your answer is definitely incorrect. It is generally true that your intuition 
about limiting cases is much better than your intuition about generic values of the 
parameters. You should use this fact to your advantage. 

A main ingredient in checking limiting cases is the Taylor series approximations. 
The series for many functions are given in Appendix A. 

The examples presented below have been taken from various problems through¬ 
out the book. For the most part, we’ll just repeat here what we’ve already said in 
the remarks given earlier in the text. 


Example 1 (Dropped ball) : A beach-ball is dropped from rest at height h. Assume 
that the drag force from the air is F ( j = —mav. We found in Section 2.3 that the 
ball’s speed and position are given by 

v(t) = — ^ (l — e ~ at ), and y(t) = h — ^(}~ e_a *)) ' (14.48) 


Let’s look at some limiting cases. If t is very small (more precisely, if at -C 1; see the 
remark following this example), then we can use the Taylor series, e~ x « 1 — x + x 2 /2, 
to make approximations to leading order in at. The v(t) in eq. (14.48) becomes 


v(t) 


►¥-■)) 


(14.49) 


plus terms of higher order in at. This answer is expected, because the drag force is 
negligible at the start, so we essentially have a freely falling body. Eq. (14.48) also 
gives 


y(t) 


(at)* 


h- 9 t 


(14.50) 


plus terms of higher order in at. Again, this answer is expected, because we essentially 
have a freely falling body. 

We may also look at large t (or rather, large at). In this case, e~ at is essentially zero, 
so the v{t) in eq. (14.48) becomes 


v(t) w -- 


(14.51) 


This is the “terminal velocity”. Its value makes sense, because it is the velocity for 
which the total force, —mg — mav, vanishes. Eq. (14.48) also gives 


y(t) ~ h - - . 


(14.52) 
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Apparently, for large t, g/a 2 is the distance our ball lags behind another ball which 
started out already at the terminal speed, g/a. 


Whenever you derive approximate answers as we did above, you gain something 
and you lose something. You lose some truth, of course, because your new answer is 
technically not correct. But you gain some aesthetics. Your new answer is invariably 
much cleaner (sometimes involving only one term), and this makes it a lot easier to 
see what’s going on. 

Remark: In the above example, it makes no sense to look at the limit where t is small 
or large, because t has dimensions. Is a year a large or small time? How about a hundredth 
of a second? There is no way to answer this without knowing what problem you’re dealing 
with. A year is short on the time scale of galactic evolution, but a hundredth of a second is 
long on the time scale of a nuclear process. 

It only makes sense to look at the limit of a small (or large) dimensionless quantity. In 
the above example, this quantity was at. The given constant a had units of T _1 , and so 
1/a set a typical time scale for the system. It therefore made sense to look at the limit 
where t < 1/a (that is, at <C 1), or likewise 1 > 1/a (that is, at 1). In the limit of a 
small dimensionless quantity, a Taylor series can be used to expand an answer in powers of 
the small quantity, as we did above. 

We sometimes get sloppy and say things like, “In the limit of small i.” But you know 
that we really mean, “In the limit of some small dimensionless quantity that has a t in the 
numerator,” or, “In the limit where t is much smaller that a certain quantity that has the 
dimensions of time.” £ 

The results of checking limits generally fall into two categories. Most of the time 
you know what the result should be, so this provides a double-check on your answer. 
But sometimes an interesting limit pops up that you might not expect. Such is the 
case in the following examples. 


Figure 14.4 

Example 2 (Two masses in 1-D): A mass m with speed v approaches a stationary 
mass M (see Fig. 14.4). The masses bounce off each other elastically. Assume all 
motion takes place in one dimension. We found in Section 4.6.1 that the final speeds 
of the particles are 


(m — M)v 
m + M 


, 2 mv 

and = M- 


(14.53) 


There are three special cases that beg to be checked: 

• If m = M, then eq. (14.53) tells us that to stops, and M picks up a speed of 
v. This is fairly believable. And it becomes quite obvious once you realize that 
these final speeds clearly satisfy conservation of energy and momentum with the 
initial conditions. 

• If M > m, then to bounces backward with speed w v, and M hardly moves. 
This is clear, because M is basically a brick wall. 
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• Ifm> M, then to keeps plowing along at speed « v, and M picks up a speed 
of « 2v. This 2v is an unexpected and interesting result (it becomes clearer if 
you consider what is happening in the reference frame of the heavy mass to), 
and it leads to some neat effects, as in Problem 4.22. 


Example 3 (Circular pendulum): A mass hangs from a massless string of length 
I. Conditions have been set up so that the mass swings around in a horizontal circle, 
with the string making an angle 9 with the vertical (see Fig. 14.5). We found in 
Section 2.5 that the angular frequency, u>, of this motion is 


(14.54) 


As far as 9 is concerned, there are two limits we should definitely check: 


• If 9 —> 90°, then u> —> oo. This makes sense; the mass has to spin very quickly 
to avoid flopping down. 

• If 9 —* 0, then u> —► \]g/t- This is the same as the frequency of a plane pendulum 
of length I (for small oscillations). You can convince yourself why this is true. 
Hint: look at the projection of the force on a given horizontal line. 



Figure 14.5 


In the above examples, we checked limiting and special cases of answers that 
were correct (I hope). This whole process is more useful (and a bit more fun) 
when you check the limits of an answer that is incorrect. In this case, you gain 
the unequivocal information that your answer is wrong. But rather than leading 
you into despair, this information is actually something you should be quite happy 
about, considering that the alternative is to carry on in a state of blissful ignorance. 
Personally, if there’s any way I’d want to discover that my answer is garbage, this 
is it. At any rate, checking limiting and special cases can often save you a lot of 
trouble in the long run... 

The lemmings get set for their race. 

With one step and two steps they pace. 

They take three and four, 

And then head on for more, 

Without checking the limiting case. 

14.3.1 Exercise 

1. Atwood’s machine * 

Consider the “Atwood’s” machine shown in Fig. 14.6, consisting of three 
masses and three frictionless pulleys. It can be shown that the acceleration of 
mi is given by (just accept this): 

3 m 2 m 3 - mi(4m 3 + m 2 ) t s 

°i = 9 -- r > (14.55) 

m 2 m 3 + mi(4m 3 + ra 2 ) 

with upward taken to be positive. Find oi in the following special cases: 



Figure 14.6 


i-Q 
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(a) m2 = 2mi = 2m3. 

(b) mi much larger than both m2 and m3. 

(c) mi much smaller than both m2 and m3. 

(d) m2 > mi = m3 

(e) mi = m 2 = m 3 . 
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14.4 Appendix D: Solving differential equations numer¬ 
ically 

Sooner or later you will encounter a differential equation that you cannot solve ex¬ 
actly. Having resigned yourself to not getting the exact answer, you should ponder 
how to obtain a decent approximation to it. In this marvellously advanced techno¬ 
logical era (which your children will dismiss with nothing more than a bewildered 
chuckle), it’s easy to write a short program that will give you a very good numerical 
answer to your problem. Given enough computer time, you can obtain any desired 
accuracy (assuming the system isn’t chaotic, but no need to worry about that for 
the systems we’ll be dealing with). We’ll demonstrate the procedure by using an 
easy problem, one that we actually do know the answer to. 

Consider the equation, 

x = —u 2 x. (14.56) 

This is of course the equation for a mass on a spring (with uj = y/k/m), and we 
know that the solution can be written, among other ways, in the form, 

x(t) = A cos (cut + (fi). (14.57) 

But let’s pretend we don’t know this. If someone comes along and gives us values 
for x'( 0 ) and i;( 0 ), it seems that somehow we should be able to find x(t) and x(t) 
for any later t, just by using eq. (14.56). Here’s how we do it. 

The plan is to discretize time into intervals of some small unit (call it e), and 
then to determine what happens at each successive point in time. If we know x(t ) 
and x(t ), then we can easily find (approximately) the value of x at a slightly later 
time, by using the definition of x. Similarly, if we know x{t) and x(t). then we 
can easily find (approximately) the value of x at a slightly later time, by using the 
definition of x. Using the definitions of the derivatives, the relations are simply 

x(t + e) & x(t)+ex(t), 

x(t + e) « x(t) + ex(t). (14.58) 

These two equations, combined with (14.56), which gives us x in terms of x. allow 
us to march along in time, obtaining successive values for x, x, and x. 7 

Here’s what a typical program might look like. (This is Maple, but even if you 
aren’t familiar with this, the general idea should be clear.) Let’s say that the particle 
starts from rest at position x = 2, and let’s pick w 2 = 5. We’ll use the notation 
where xl stands for x, and x2 stands for x. And e stands for e. Let’s calculate x at 
t = 3. 

7 Of course, another expression for x is the definitional one, analogous to eqs. (14.58), involving 
the third derivative. But this would then require knowledge of the third derivative, and so on with 
higher derivatives, and we would end up with an infinite chain of relations. An equation of motion 
such as eq. (14.56) (which in general could be an F = ma, r = 7a, or Euler-Lagrange equation) 
relates x back to x (and possibly x), thereby creating an intertwined relation among x , x, and x. 
and eliminating the need for an infinite and useless chain. 
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x:=2: # initial position 

xl:=0: # initial speed 

e:=.01: # a small time interval 

for i to 300 do # do 300 steps (ie, up to 3 seconds) 
x2:=-5*x: # the given equation 

x:=x+e*xl: # how x changes, by definition of xl 

xl:=xl+e*x2: # how xl changes, by definition of x2 

end do: # the Maple command to stop the do loop 

x; # print the value of x 


This procedure of course won’t give the exact value for x, because x and x don’t 
really change according to eqs. (14.58). These equations are just first-order approx¬ 
imations to the full Taylor series with higher-order terms. Said differently, there is 
no way the above procedure can be exactly correct, because there are ambiguities 
in how the program can be written. Should line 5 come before or after line 7? That 
is, in determining x at time t + e, should you use the x at time t or t + e? And 
should line 7 come before or after line 6? The point is that for very small e, the 
order doesn’t matter much. And in the limit e —> 0, the order doesn’t matter at all. 

If we want to obtain a better approximation, we can just shorten e down to .001 
and increase the number of steps to 3000. If the result looks basically the same as 
with e = .01, then we know we pretty much have the right answer. 

In the present example, e = .01 yields x 1.965 after 3 seconds. If we set 
e = .001, then we obtain x & 1.836. And if we set e = .0001, then we get x fw 1.823. 
The correct answer must therefore be somewhere around x = 1.82. And indeed, if 
we solve the problem exactly, we would obtain x(t) = 2cos(v / 5f)- Plugging in t = 3 
gives x & 1.822. 

This is a wonderful procedure, but it shouldn’t be abused. It’s nice to know 
that we can always obtain a decent numerical approximation if all else fails. But we 
should set our initial goal on obtaining the correct algebraic expression, because this 
allows us to see the overall behavior of the system. And besides, nothing beats the 
truth. People tend to rely a bit too much on computers and calculators nowadays, 
without pausing to think about what is actually going on in a problem. 

The skill to do math on a page 
Has declined to the point of outrage. 

Equations quadratica 
Are solved on Math’matica, 

And on birthdays we don’t know our age. 
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14.5 Appendix E: F = ma vs. F = dp/dt 

In nonrelativistic mechanics, 8 the equations F = ma and F = dp/dt say exactly the 
same thing, provided that m is constant. But if m in not constant, then dp/dt = 
d(mv)/dt = ma + ( dm/dt)v , which does not equal ma. In this case, should we use 
F = ma or F = dp/dt? Which law correctly describes the physics? The answer to 
this depends on what you label as the “system” to which you associate the quantities 
m, p. and a. You can generally do a problem using either F = ma or F = dp/dt , 
but you must be very careful about how you label things and how you treat them. 
The subtleties are best understood through two examples. 


Example 1 (Sand dropping into cart): Consider a cart into which sand is 
dropped (vertically) at a rate dm/dt = a. With what force must you push on the 
cart to keep it moving (horizontally) at a constant speed v? 

First solution: Let m(t) be the mass of the cart-plus-sand-inside system (label this 
system as “C”). If we use F = ma (where a is the acceleration of the cart, which 
is zero), then we obtain F = 0, which is incorrect. The correct expression to use is 
F = dp/ dt. This gives 


„ dp dm „ __. 

F = — = ma H—— v = 0 + av. (14.59) 

dt dt 

This makes sense, because your force is what increases the momentum of C, and this 
momentum increases simply because the mass of C increases. 

Second solution: It is possible to solve this problem by using F = ma, provided 
that you let your system be a small piece of mass that is being added to the cart. 
Your force is what accelerates this mass from rest to speed v. Consider a mass A to 
that falls into the cart during a time At. Imagine that it falls into the cart in one 
lump at the start of the At, and then accelerates (via friction) up to speed v after 
time At (and then this process repeats during each successive At interval). Then 
F = ma = Am(v/At). Writing this as ( Am/At)v gives the av result in the first 
solution. 


Example 2 (Sand leaking from cart): Consider a cart that leaks sand out of 
the bottom at a rate dm/dt = a. If you apply a force F to the cart, what is its 
acceleration? 


Solution: Let m(t) be the mass of the cart-plus-sand-inside system (label this system 
as “(7”). In this example, we want to use F = mo. So the acceleration is simply 


(14.60) 


Note that since m decreases with time, a increases with time. 

8 We won’t bother with relativity in this Appendix, because nonrelativistic mechanics contains 
all the critical aspects we want to address. 
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We used F = ma because at any instant, the mass m is what is being accelerated by 
the force F. If you want, you can imagine the process occurring in discrete steps: The 
force pushes on the mass for a short period of time, then a little piece instantaneously 
leaks out; then the force pushes again on the new mass, then another little piece leaks 
out; and so on. In this discretized scenario, it is clear that F = ma is the appropriate 
formula, because it holds for each step in the process. The only ambiguity is whether 
to use m or m + dm at a certain time, but this yields a negligible error. 

Remarks: It is still true that F = dp/dt in this problem, provided that you let F be total 
force, and let p be the total momentum. In this problem, F is the only force. However, the 
total momentum consists of both the sand in the cart and the sand that has leaked out and 
is falling through the air. * * * * * * * * 9 A common mistake is to use F = dp/dt, with p being only the 
system C”s momentum. 

There is an simple example that demonstrates why F = dp/dt doesn’t work when p refers 
only to C. Imagine that F = 0, and let the cart move with speed v. Cut the cart in half, 
and label the back part as the “leaked sand”, and the front part as the “cart”. If you want 
the cart’s p to have dp/dt = F = 0, then the cart’s speed must double if its mass gets cut 
in half. But this is nonsense. Both halves of the cart simply continue to move at the same 
rate, ft 


To sum up, F = dp/dt is always valid, provided that you use the total force and 

total momentum of a given system of particles. This approach, however, can get 

messy in certain situations. So in some cases it is easier to use an F = ma argument, 

but you must be careful to correctly identify the system that is being accelerated by 

the force. The asymmetry in the above two examples is that in the first example, 

the force does indeed accelerate the incoming sand; whereas in the second example, 

the force does not accelerate (or decelerate) the outgoing sand. F has nothing to do 

with the leaked sand. 


9 If there were air resistance, we would have to worry about its effect on the falling sand if we 
wanted to use F = dp/dt to solve the problem, where p is the total momentum. This is clearly not 
the best way to do the problem. If complicated things happen with the sand in the air, it would 
be foolish to consider this part of the sand if we don’t have to. 
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14.6 Appendix F: Existence of principal axes 

In this Appendix, we will prove Theorem 8.4. That is, we will show that an or¬ 
thonormal set of principal axes does indeed exist for any object, and for any choice 
of origin. It is not crucial that you study this proof. If you want to simply accept 
the fact that principal axes exist, that’s perfectly fine. But the method we’ll use 
in this proof is one you’ll see again and again in your physics career, in particular 
when you study quantum mechanics (see the remark following the proof). 

Theorem 14.1 Given a real, symmetric 3x3 matrix, I, there exist three orthonor¬ 
mal real vectors, a)*,, and three real numbers, Iwith the property that 

Lb*, = 4 u} k . (14.61) 


Proof: This theorem holds more generally with 3 replaced by N (all the steps 
below easily generalize), but we’ll work with N = 3, to be concrete. 

Consider a general matrix, I (we don’t need to assume yet that it’s real or 
symmetric). Assume that Iu = Iu for some vector u and some number I. 10 This 
may be rewritten as 



In order for there to be a nontrivial solution for the vector u (that is, one where u 4 
(0,0,0)), the determinant of this matrix must be zero . 11 Taking the determinant, 
we see that we get an equation for I of the form 

al 3 + bl 2 + cl + d = 0. (14.63) 

The constants a, b, c, and d are functions of the matrix entries 4?, but we won’t 
need their precise form to prove this existence theorem. The only thing we need 
this equation for is to say that there do exist three (generally complex) solutions for 
I, because the equation is of third degree. 

We will now show that the solutions for I are real. This will imply that there 
exist three real vectors u satisfying Iu = Iu, because we can plug the real Ts back 
into eq. (14.62) and solve for the real components u x , u y , and u z , up to an overall 
constant. We will then show that these vectors are orthogonal. 

• Proof that the I’s are real: This follows from the real and symmetric conditions 
on I. Start with the equation Iu = Iu, and take the dot product with u* to 
obtain 


u* • Iu = u* • Iu 

= Iu* • u. (14.64) 

10 Such a vector u is called an eigenvector of I, and I is the associated eigenvalue. But don’t let 
these names scare you. They’re just definitions. 

11 If the determinant were not zero, then we could explicitly construct the inverse of the matrix, 
which involves cofactors divided by the determinant. Multiplying both sides by this inverse would 
show that u = 0. 
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The vector u* is the vector obtained by simply complex conjugating each 
component of u (we don’t know yet that u can be chosen to be real). On the 
right side, I is a scalar, so we can take it out from between the u* and u. 

The fact that I is real implies that if we complex conjugate the equation 
Iu = lu, we obtain Iu* = I*u* (we know that I is real, but we don’t know 
yet that I is real). If we then take the dot product of this equation with u, 
we obtain 

u • Iu* = /*u • u*. (14.65) 

We now claim that if I is symmetric, then a • lb = b • la, for any vectors a 
and b. (We’ll leave this for you to show by simply multiplying each side out.) 
In particular, u* • Iu = u • Iu*, so eqs. (14.64) and (14.65) give 

(I- /*)u • u* 0. (14.66) 

Since u • u* = |«i| 2 + |tt 2 | 2 + \u^\ 2 ^ 0, we must have I = I*. Therefore, I is 
real. 

• Proof that the u are orthogonal: This follows from the symmetric condition 
on I. Let Iui = IiUi, and Iu 2 = I 2 u 2 . Take the dot product of the former 
equation with u 2 to obtain 


u 2 • Iui = /iu 2 • m, (14.67) 

and take the dot product of the latter equation with u 2 to obtain 


ui • Iu 2 = I 2 ui • u 2 . (14.68) 

As above, the symmetric condition on I implies that the left-hand sides of eqs. 
(14.67) and (14.68) are equal. Therefore, 

(It - I 2 )ui • u 2 = 0. (14.69) 

There are two possibilities here: (1) If I\ ^ I 2 , then we are done, because 
ui • u 2 — 0, which says that u 2 and u 2 are orthogonal. (2) If /] — / 2 = I, 
then we have I(aui + 6u 2 ) = I(aui + feu 2 ), for any a and b. So any linear 
combination of ui and u 2 has the same property that ui and u 2 have (namely, 
that applying I is the same as simply multiplying by I). We therefore have a 
whole plane of such vectors, so we can pick any two orthogonal vectors in this 
plane to be called Ui and u 2 . ■ 

This theorem proves the existence of principal axes, because the inertia tensor in 
eq. ( 8 . 8 ) is indeed a real and symmetric matrix. 

Remark: (Warning: This remark has nothing to do with classical mechanics. It is 
simply an ill-disguised excuse to write down another limerick.) In Quantum Mechanics, 
it turns out that any observable quantity, such as position, energy, momentum, angular 
momentum, etc., can be represented by a Hermitian matrix, with the observed value being 
an eigenvalue of the matrix. A Hermitian matrix is a (generally complex) matrix with the 
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property that the transpose of the matrix equals the complex conjugate of the matrix. For 
example, a 2 x 2 Hermitian matrix must be of the form, 



(14.70) 


for real numbers a, b, c, and d. Now, if observed values are to be given by the eigenvalues 
of such a matrix, then the eigenvalues had better be real, because no one (in this world, at 
least) is about to go for a jog of 4+3 i miles, or pay an electric bill for 17—43* kilowatt-hours. 
And indeed, you can show via a slightly modified version of the above “Proof that the I are 
real” procedure that the eigenvalues of any Hermitian matrix are real. (And likewise, the 
eigenvectors are orthogonal.) This is, to say the least, very fortunate. 

God’s first tries were hardly ideal, 

For complex worlds have no appeal. 

So in the present edition, 

He made things Hermitian, 

And this world, it seems, is quite real, ft 
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14.7 Appendix G: Diagonalizing matrices 


y 


This appendix is relevant to Section 8.3, which covers principal axes. The process 
of diagonalizing matrices (that is, finding the eigenvectors and eigenvalues ) has 
applications in countless types of problems in a wide variety of subjects. We will 
describe the process here as it applies to principle axes and moments of inertia. 

Let’s find the three principal axes and moments of inertia for a square with side 
length a, mass m, and one corner at the origin. The square lies in the x-y plane, 
with sides along the x- and y -axes (see Fig. 14.7). 

We’ll choose the given x- y- and 2 -axes as our initial basis axes. From eq. ( 8 . 8 ), 
the matrix I (with respect to this initial basis) is easily shown to be 



( fy 2 

-Jxy 

0 \ / 

' 1/3 

-1/4 

0 \ 

Figure 14.7 

I = P ~fxy 

Jx 2 

0 = ma 2 

-1/4 

1/3 

0 


V 0 

0 

J(x 2 + y 2 ) y ' 

V 0 

0 

2/3 ) 


where p is the mass per unit area, so that a 2 p = m. We have used the fact that 
2 = 0 , and we have not bothered to write the dxdy in the integrals. 

Our goal is to find the basis in which I is diagonal. That is, we want to find 
three solutions 12 for u (and I) in the equation Iu = /u. Letting I = Xrna 2 (to make 
things look a little cleaner), and using the above explicit form of I, the equation 
(I — /)u = 0 becomes 

/ 1/3 - A -1/4 0 \(u x \ (0\ 

ma 2 -1/4 1/3-A 0 \ \ u y = 0 . (14.72) 

V 0 0 2/3 -A ) \ u z j \o ) 

In order for there to be a nonzero solution for the components u x ,u y ,u z , the deter¬ 
minant of this matrix must be zero. The resulting cubic equation for A is easy to 
solve, because the determinant is simply [(1/3 — A ) 2 — (l/4) 2 ](2/3 — A) = 0. The so¬ 
lutions are A = 1/3 ± 1/4, and A = 2/3. So our three moments of inertia, I = A ma 2 , 

h = j,^ma 2 , I 2 = r^ma 2 , / 3 = ^ ma 2 . (14.73) 

These are the eigenvalues of I. 

What are the vectors, Up U 2 , and u 3 , associated with each of these /’s? Plugging 
A = 7/12 into (14.72) gives the three equations (one for each component), — u x — 
u y = 0, — u x — u y = 0, and u z = 0. These are redundant equations (that was the 
whole point of setting the determinant equal to zero). So u x = —u y , and u z = 0. 
The vector may therefore be written as ui = (c, —c,0), where c is any constant. 13 
If we want a normalized vector, then c = 1 /i/2- In a similar manner, plugging 
A = 1/12 into eq. (14.72) gives U 2 = (c, c,0). And finally, plugging A = 1/3 into eq. 

12 One obvious solution is u = z, because Iz = (2/3)ma 2 z. From the orthogonality result of 
Theorem 8.4, we know that the other two vectors must lie in the x-y plane. So we could quickly 
reduce this problem to a two-dimensional one, but let’s forge ahead with the general method. 

13 We can only solve for u up to an overall constant, because if Iu = Iu is true for a certain u, 
then it is also true that I(cu) =?'J(ru). where c is any constant. 
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(14.72) gives U 3 = (0,0, c), as claimed in the above footnote. Our three orthonormal 
principal axes corresponding to the moments in eq. (14.73) are therefore 

= i ’ 3 = (0 ' 0 ' 1) ' < 14 - 74) 

These are the eigenvectors of I. These axes are shown in Fig. 14.8. In the new 
basis of the principal axes, the matrix I takes the form 

/ 7/12 0 0 \ 

I = ma 2 0 1/12 0 . (14.75) 

\ 0 0 2/3 ) 

The basic idea is that from now on we should use the principal axes as our basis 
vectors. We can forget we ever had anything to do with the original x-y-z- axes. 

Remarks: (1) R + I 2 = 7 3 , as the perpendicular axis theorem demands. (2) I 2 is 
the moment around one diagonal through the center of the square, which of course equals 
the moment around the other diagonal through the center. But the latter is related to R 
by the parallel axis theorem; and indeed, R = I 2 + m{a/\/2) 2 . (3) Convince yourself why 
I 2 should equal the moment around the center of a stick of mass m and length a. (Any 
axis through the center of a square, in the plane of the square, has the same moment.) (4) 
An application of the parallel and perpendicular axis theorems gives (considering an axis 
through the center and parallel to the 7 3 axis) 7 3 = 27 2 + mRxjsJ 2) 2 . £ 


y 

/&2 


\<»1 

Figure 14.8 
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14.8 Appendix H: Qualitative relativity questions 

1. Is there such a thing as a perfectly rigid body? 

Answer: No. Since information can move no faster than the speed of light, it 
takes time for the atoms in the body to communicate with each other. If you 
push on one end of a rod, then the other end will not move right away. 

2. Moving clocks run slow. Does this result have anything to do with the time it 
takes light to travel from the clock to your eye? 

Answer: No. When we talk about how fast a clock is running in a given 
frame, we are referring to what the clock actually reads in that frame. It will of 
course take time for the light from the clock to reach an observer’s eye, but it is 
understood that the observer subtracts off this transit time in order to calculate 
the time at which the clock actually shows a particular reading. 

Likewise, other relativistic effects, such as length contraction and loss of simul¬ 
taneity, have nothing to do with the time it takes light to reach your eye. They 
deal only with what really is, in your frame. 

3. Does time dilation depend on whether a clock is moving across your vision or 
directly away from you? 

Answer: No. A moving clock runs slow, no matter which way it is moving. 

4. Does the special-relativistic time dilation depend on the acceleration of the 
moving clock? 

Answer: No. The time-dilation factor is 7 = 1/y 1 — v 2 /e 2 , which does not 
depend on a. The only relevant quantity is the v at a given instant. It doesn’t 
matter if v is changing. 

Of course, if you are accelerating, then you can’t naively apply the results of 
special relativity. (To do things correctly, it is perhaps easiest to think in terms 
of general relativity. But GR is actually not required; see Chapter 13 for a 
discussion of these issues.) But as long as you represent an inertial frame, then 
the clock you are viewing can undergo whatever motion it wants, and you will 
observe it running slow by the simple factor, 7 . 

5. Someone says, “A stick that is length-contracted isn’t really shorter, it just 
looks shorter.” Do you agree? 

Answer: Hopefully not. The stick really is shorter in your frame. Length 
contraction has nothing to do with how things look. It has to do with where 
the ends of the stick are at simultaneous times in your frame. (That is, after 
all, how you measure the length of something.) At a given instant in time (in 
your frame), the distance between the ends of the stick is indeed less than the 
proper length of the stick. 

6 . Consider a stick that moves in the direction in which it points. Does its length 
contraction depend on whether this direction is across your vision or directly 
away from you? 
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Answer: No. The stick is length-contracted in both cases. Of course, if you 
look at the stick in the latter case, then all you will see is the end, which will 
just be a dot. But the stick is indeed shorter in your reference frame. 

7. A mirror moves toward you at speed v. You shine a light towards it and the 
light beam bounces back at you. What is the speed of the reflected beam? 

Answer: The speed is c, as always. You will observe the light having a higher 
frequency, due to the Doppler effect. But the speed is still c. 

8. In relativity, the order of two events in one frame may be reversed in another 
frame. Does this imply that there exists a frame in which I get off a bus before 
I get on it? 


Answer: No. The order of two events can be reversed in another frame only 
if the events are spacelike separated. That is, if Ax > cAt (in other words, 
the events are too far apart for even light to get from one to the other). The 
two relevant events here (getting on the bus, and getting off the bus) are not 
spacelike separated, because the bus travels at a speed less than c, of course. 
They are timelike separated. Therefore, in all frames it is the case that I get off 
the bus after I get on it. 

There would be causality problems if there existed a frame in which I got off 
the bus before I got on it. If I break my ankle getting off a bus, then I wouldn’t 
be able to make the fast dash that I made to catch the bus is the first place, in 
which case I wouldn’t have the opportunity to break my ankle getting off the 
bus, in which case I could have made the fast dash to catch the bus and get on, 
and, well, you get the idea. 

9. You are in a spaceship sailing along in outer space. Is there any way you can 
measure your speed without looking outside? 

Answer: There are two points to be made here. First, the question is mean¬ 
ingless, because absolute speed does not exist. The spaceship does not have a 
speed; it only has a speed relative to something else. 

Second, even if the question asked for the speed with respect to, say, a piece of 
stellar dust, the answer would be “no.” Uniform speed is not measurable from 
within the spaceship. Acceleration, on the other hand, is measurable (assuming 
there is no gravity around to confuse it with). 

10. If you move at the speed of light, what shape does the universe take in your 
frame? 

Answer: The question is meaningless, because it is impossible for you to move 
at the speed of light. A meaningful question to ask is: What shape does the 
universe take if you move at a speed very close to c? The answer is that in your 
frame everything would be squashed along the direction of your motion. Any 
given region of the universe would be squashed down to a pancake. 
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11. Two objects fly toward you, one from the east with speed u, and the other 
from the west with speed v. Is it correct that their relative speed, as measured 
by you, is u + u? Or should you use the velocity-addition formula, V = 
(u + v)/(1 + uv/c 2 )7 Is it possible for their relative speed, as measured by you, 
to exceed c? 

Answer: Yes, no, yes, to the three questions. It is legal to simply add the two 
speeds to obtain u + v. There is no need to use the velocity-addition formula, 
because both speeds here are measured with respect to the same thing, namely 
you. It is perfectly legal for the result to be greater than c (but it must be less 
than 2 c). 

You need to use the velocity-addition formula when, for example, you are given 
the speed of a ball with respect to a train, and also the speed of the train with 
respect to the ground, and your goal is to find the speed of the ball with respect 
to the ground. The point is that now the two given speeds are measured with 
respect to different things, namely the train and the ground. 

12. Two clocks at the ends of a train are synchronized with respect to the train. 
If the train moves past you, which clock shows the higher time? 

Answer: The rear clock shows the higher time. It shows Lv/c 2 more than the 
front clock, where L is the proper length of the train. 

13. A train moves at speed 4c/5. A clock is thrown from the back of the train 

to the front. As measured in the ground frame, the time of flight is 1 second. 
Is the following reasoning correct? “The 7 -factor between the train and the 
ground is 7 = — ( 4 / 5) 2 — 5/3. And since moving clocks run slow, the 

time elapsed on the clock during the flight is 3/5 of a second.” 

Answer: No. It is incorrect, because the time-dilation result holds only for 
two events that happen at the same place in the relevant reference frame (the 
train, here). The clock moves with respect to the train, so the above reasoning 
is not correct. 

Another way of seeing why it must be incorrect is the following. A certainly valid 
way to calculate the clock’s elapsed time is to find the speed of the clock with 
respect to the ground (more information would have to be given to determine 
this), and to then apply time dilation with the associated 7 -factor to arrive at 
the answer of 1/7. Since the clock’s v is definitely not 4c/5, the correct answer 
is definitely not 3/5 s. 

14. Person A chases person B. As measured in the ground frame, they have speeds 
4c/5 and 3c/5, respectively. If they start a distance L apart (as measured in 
the ground frame), how much time will it take (as measured in the ground 
frame) for A to catch B1 

Answer: As measured in the ground frame, the relative speed is 4c/5 — 3c/5 = 
c/5. Person A must close the initial gap of L, so the time it takes is L/(c/ 5) = 
5 L/c. There is no need to use any fancy velocity-addition or length-contraction 
formulas, because all quantities in this problem are measured with respect to 
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the same frame. So it quickly reduces to a simple “(rate) (time) = (distance)” 
problem. 

15. Is the “the speed of light is the same in all inertial frames” postulate really 
necessary? That is, is it not already implied by the “the laws of physics are 
the same in all inertial frames”? 

Answer: Yes, it is necessary. It turns out that nearly all the results in relativity 
can be deduced by using only the “the laws of physics are the same in all inertial 
frames” postulate. What you can find (with some work) is that there is some 
limiting speed (which may or may not be infinite). But you still have to postulate 
that light is the thing that moves with this speed. See Section 10.8. 

16. Imagine closing a very large pair of scissors. It is quite possible for the point 
of intersection of the blades to move faster than the speed of light. Does this 
violate anything in relativity? 

Answer: No. If the angle between the blades is small enough, then the tips 
of the blades (and all the other atoms in the scissors) can move at a speed well 
below c, while the intersection point moves faster than c. But this does not 
violate anything in relativity. The intersection point is not an actual object, so 
there is nothing wrong with it moving faster than c. 

We should check that this setup cannot be used to send a signal down the 
scissors at a speed faster than c. Since there is no such thing as a rigid body, 
it is impossible to get the far end of the scissors to move right away, when you 
apply a force with your hand. The scissors would have to already be moving, in 
which case the motion is independent of any decision you make at the handle 
to change the motion of the blades. 

17. Two twins travel away from each other at relativistic speed. The time-dilation 
result from relativity says that each twin sees the other’s clock running slow, 
so each says the other has aged less. How would you reply to someone who 
asks, “But which twin really is younger?” 

Answer: It makes no sense to ask which twin really is younger, because the two 
twins aren’t in the same reference frame; they are using different coordinates to 
measure time. It’s as silly as having two people run away from each other into 
the distance (so that each person sees the other become very small), and then 
asking: Who is really smaller? 

18. The momentum of an object with mass m and speed v is p = jmv. “A photon 
has zero mass, so it should have zero momentum.” Correct or incorrect? 

Answer: Incorrect. True, m is zero, but the 7 factor is infinite because v = c. 
Infinity times zero is undefined. A photon does indeed have momentum, and it 
equals E/c (which equals hv/c, where v is the frequency of the light). 


19. It is not necessary to postulate the impossibility of accelerating an object to 
speed c. It follows as a consequence of the relativistic form of energy. Explain. 
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Answer: E = 'jmc 2 , so if v = c then 7 = 00 , and the object must have an 
infinite amount of energy (unless m = 0, as for a photon). All the energy in the 
universe, let alone all the king’s horses and all the king’s men, can’t accelerate 
something to speed c. 
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14.9 Appendix I: Lorentz transformations 

In this Appendix, we will give an alternate derivation of the Lorentz transforma¬ 
tions, eqs. (10.13). The goal here is to derive them from scratch, using only the 
two postulates of relativity. We will not use any of the results derived in Section 
10.2. Our strategy will be to use the relativity postulate (“all inertial frames are 
equivalent”) to figure out as much as we can, and to then invoke the speed-of-light 
postulate at the end. The main reason for doing things in this order is that it will 
allow us to derive an very interesting result in Section 10.8. 

As in Section 10.3, consider a coordinate system, S', moving relative to another 
system, S (see Fig. 14.9). Let the constant relative speed of the frames be v. Let 
the corresponding axes of S and S' point in the same direction, and let the origin 
of S' move along the x-axis of S, in the positive direction. 

As in Section 10.3, we want to find the constants, A, B, C, and D, in the 
relations, 

Ax = A Ax' + B At', 

At = CAt' + DAx'. (14.76) 

The four constants here will end up depending on v (which is constant, given the 
two inertial frames). 

There are four unknowns in eqs. (14.76), so we need four facts. The facts we 
have at our disposal (using only the two postulates of relativity) are the following. 

1. The physical setup: S' travels at speed v with respect to S. 

2. The principle of relativity: S should see things in S' in exactly the same way as 
S' sees things in S (except for perhaps a minus sign in some relative positions, 
but this is just convention). 

3. The speed-of-light postulate: A light pulse with speed c in S' also has speed c 
in S. 

The second statement here contains two independent bits of information. (It con¬ 
tains at least two, because we will indeed be able to solve for our four unknowns. 
And it contains no more than two, because then our four unknowns would be over¬ 
constrained.) The two bits that are used depend on personal preference. Three that 
are commonly used are: (a) the relative speed looks the same from either frame, (b) 
time dilation (if any) looks the same from either frame, and (c) length contraction 
(if any) looks the same from either frame. It is also common to recast the second 
statement in the form: The Lorentz transformations are equal to their inverse trans¬ 
formations (up to a possible minus sign). We’ll choose to work with (a) and (b). 
Our four independent facts are then: 

1. S' travels at speed v with respect to S. 

2 . S travels at velocity —v with respect to S'. The minus sign here is due to the 
convention that we picked the positive x-axes of the two frames to point in 
the same direction. 


y y' 



Figure 14.9 



XIV-30 


CHAPTER 14. APPENDICES 


3. Time dilation (if any) looks the same from either frame. 

4. A light pulse with speed c in S' also has speed c in S. 

Let’s see what these imply, in the above order. 14 

• (1) says that a given point in S' moves at speed v with respect to S. Letting 
x' = 0 (which is understood to be Ax' = 0, but we’ll drop the A’s from here 
on) in eqs. (14.76) gives x/t = B/C. This must equal v. Therefore, B = vC, 
and the transformations become 

x = Ax 1 + vCt ', 

t = Ct' + Dx'. (14.77) 

• (2) says that a given point in S moves at velocity —v with respect to S'. 
Letting x = 0 in the first of eqs. (14.77) gives x'/t 1 = —vC/A. This must 
equal —v. Therefore, C = A. and the transformations become 

x = Ax' + vAt', 

t = At' + Dx'. (14.78) 


• (3) can be used in the following way. How fast does a person in S see a clock 
in S' tick? (The clock is assumed to be at rest with respect to S'.) Let our 
two events be two successive ticks of the clock. Then x' = 0, and the second 
of eqs. (14.78) gives 

t = At'. (14.79) 

In other words, one second on S h s clock takes a time of A seconds in 5"s frame. 
Consider the analogous situation from S h s point of view. How fast does a 
person in S' see a clock in S tick? (The clock is now assumed to be at rest 
with respect to S, in order to create the analogous setup. This is important.) 
If we invert eqs. (14.78) to solve for x' and t' in terms of x and t, we find 

x' 
t' 


x — vt 
A — vD ’ 

At — Dx 
A(A - vD) ' 


(14.80) 


Two successive ticks of the clock in S satisfy x = 0 , 
(14.80) gives 


A — vD 


the second of eqs. 

(14.81) 


In other words, one second on S’s clock takes a time of 1/(A — vD ) seconds 
in S"’s frame. 


14 In what follows, we could obtain the final result a little quicker if we invoked the speed-of-light 
fact prior the time-dilation one. But we’ll do things in the above order so that we can easily carry 
over the results of this Appendix to the discussion in Section 10.8. 
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Both eqs. (14.79) and (14.81) apply to the same situation (someone looking 
at a clock flying by). Therefore, the factors on the right-hand sides must be 
equal, that is, 


A = - 


" ‘(-.is- 


A — vD 

Our transformations in eqs. (14.78) therefore take the form 
x = A(x' + vt'), 


(14.82) 


(14.83) 


i (4) may now be used to say that if x 1 = ct ', then x = ct. In other words, if 
x' = ct', then 


A((ct') + vt') 


1 + n§“ 


0J 


Solving for A gives 


(14.84) 


(14.85) 


y/\ — v z /c 2 

We have chosen the positive square root so that the positive x and x' axes 
point in the same direction. 

The constant A is commonly denoted by 7 , so we may finally write our Lorentz 
transformations, eqs. (14.83), in the form, 


x = y(a: / + vt 1 ), 
t = 7 (t' + vx'/(?) 


\J\ — v 2 /c 2 ’ 


(14.86) 

(14.87) 


in agreement with eq. (10.13). 
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14.10 Appendix J: Resolutions to the twin paradox 

We have discussed the twin paradox in Chapters 10 and 13, in both the text and in 
various problems. To summarize, the twin paradox deals with twin A who stays on 
the earth , 15 and twin B who travels quickly to a distant star and back. When they 
meet up again, they discover that B is younger. This is true because A can use the 
standard special-relativistic time-dilation result to say that B 's clock runs slow by 
a factor 7 . 

The “paradox” arises from the fact that the situation seems symmetrical. That 
is, it seems as though each twin should be able to consider herself to be at rest, so 
that she sees the other twin’s clock running slow. So why does B turn out to be 
younger? 

The resolution of the paradox is that the setup is in fact not symmetrical, because 
B will have to turn around and will thus undergo acceleration. She will therefore 
not always be in an inertial frame. Therefore, she cannot apply the simple special- 
relativistic time-dilation result. 

While the above reasoning is sufficient to get rid of the paradox, it is not quite 
complete, because (a) it does not explain how the result from £>’s point of view 
quantitatively agrees with the result from A ’s point of view, and (b) the paradox can 
actually be formulated without any mention of acceleration, in which case slightly 
different reasoning applies. 

Below is a list of all the complete resolutions I can think of. The descriptions 
are terse, but I refer you to the specific problem or section in the text where things 
are discussed in more detail. Of course, many of these resolutions are simply slight 
variations of each other, so it isn’t quite clear whether some of them should count 
as separate resolutions. But here’s my list: 

1. Head-start effect: Let the distant star be labeled as C. Then on the 
outward part of the journey, B sees O’s clock ahead of A ’s by Lv /c 2 , because 
C is the rear clock in the universe as the universe flies by. But after B turns 
around, A becomes the rear clock and is therefore now ahead of C. This means 
that A’s clock must jump forward very quickly, from B's point of view. (See 
Problem 10.2 and Section 10.2.1.) 

2. Looking out the portholes: Imagine many clocks lined up between the 
earth and the star, all synchronized in the earth-star frame. And imagine 
looking out the portholes of the spaceship and making a movie of the clocks 
as you fly past them. Although you see each individual clock running slow, 
you will see the “effective” clock in the movie (which is really many successive 
clocks) running fast. This effect is simply a series small applications of the 
head-start effect mentioned above. 

3. Minkowski diagram: Draw a Minkowski diagram with the axes in A’s 
frame perpendicular. Then the lines of simultaneity (that is, the successive 

15 We should actually have A floating in space, to avoid any GR time-dilation effects from the 
earth’s gravity. But if B travels quickly enough, the SR effects will dominate the GR ones. 
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x-axes) in B's frame will be titled in different directions for the outward and 
inward parts of the journey. The change in the tilt at the turnaround causes a 
large amount of time to advance on T's clock, as measured in EL s frame. (See 
Section 10.5 and Figure 10.50.) 

4. General Relativistic turnaround effect: The acceleration that B feels 
when she turns around may equivalently be thought of as a gravitational field. 
Twin A on the earth is then high up in the gravitational field, so A sees her 
clock run very fast during the turnaround. This causes T’s clock to show more 
time in the end. (See Problem 13.9.) 

5. Doppler effect: By equating the total number of signals one twin sends out 
with the total number of signals the other twin receives, we can relate the 
total times on their clocks. (See Exercise 10.32.) 
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14.11 Appendix K 

: Physical constants and data 

Earth 

Mass 

M e = 5.98 • 10 24 kg 

Mean radius 

Re = 6.37 • 10 6 m 

Mean density 

5.52 g/cm 3 

Surface acceleration 

g = 9.81 m/s 2 

Mean distance from sun 

1.5 • 10 11 m 

Orbital speed 

29.8 km/s 

Period of rotation 

23 h 56 min 4 s = 8.6164 • 10 4 s 

Period of orbit 

365 days 6 h = 3.16 • 10 7 s 

Moon 

Mass 

M l = 7.35 • 10 22 kg 

Radius 

R l = 1.74- 10 6 m 

Mean density 

3.34 g/cm 3 

Surface acceleration 

1.62 m/s 2 « g/6 

Mean distance from earth 

3.84 • 10 8 m 

Orbital speed 

1.0 km/s 

Period of rotation 

27.3 days = 2.36 • 10 6 s 

Period of orbit 

27.3 days = 2.36 • 10 6 s 

Sun 

Mass 

M s = 1.99- 10 3 ° kg 

Radius 

Rs = 6.96 • 10 8 m 

Surface acceleration 

274 m/s 2 « 28 g 

Fundamental constants 

Speed of light 

c = 2.998 • 10 8 m/s 

Gravitational constant 

G = 6.673 • lO" 11 N m 2 /kg 2 

Planck’s constant 

h = 6.63 • 10 -34 J s 

Electron charge 

e= 1.602- lO" 19 C 

Electron mass 

m e = 9.11 ■ 10“ 31 kg = 0.511 MeV/c 2 

Proton mass 

m v = 1.673 • lO" 27 kg = 938.3 MeV/c 2 

Neutron mass 

m n = 1.675 • lO" 27 kg = 939.6 MeV/c 2 
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